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FIELD OF THE INVENTION 

The present invention relates to CEA binding polypeptides and compositions for 
detection and treatment of cancer. More particularly, the invention relates to materials 
useful for and methods of detecting, imaging, localizing, and targeting tumors exhibiting 

25 CEA. The invention provides binding polypeptides capable of associating specifically 
with CEA and of distinguishing between CEA and known cross-reactive antigens, such as 
NCA (non-specific cross-reacting antigen). Such binding polypeptides are useful for the 
detection, imaging, localization, and targeting of CEA-containing tissues or solutions, 
e.g., by radioimaging, magnetic resonance imaging, or x-ray imaging, and are also useful 

30 in the diagnosis and treatment of cancers associated with CEA. 

FEDERAL FUNDING 

The present invention was partly supported by STTR grant no. 1R41CA79230-01 . 
As a result, the U.S. government retains certain rights in the invention. 
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BACKGROUND OF THE INVENTION 

Carcinoembryonic antigen, or CEA, is a complex immunoreactive glycoprotein 
with a molecular weight of 1 80,000 found in adenocarcinomas of endodermally derived 
5 digestive system epithelia and fetal colon. Tumor cells at many sites, including colon, 
breast, lung, cervix, ovary, stomach, bladder, pancreas and esophagus express large 
amounts of carcinoembryonic antigen and/or the closely related immunoglobulin 
supergene family member, nonspecific cross-reactive antigen, or NCA, on their surfaces. 
The expression of these glycoproteins, especially CEA, in normal cells is very limited in 

10 mature individuals (as opposed to prenatal infants), and this antigen has been used as a 
target in immunoassays for diagnosis and for serially monitoring cancer patients for 
recurrent disease or response to therapy. (See, Mach et al., Immun. Today, 2: 239, 1981; 
Berche et al, Br. Med. J., 285: 1447, 1982.) Anti-CEA antibodies also have been 
proposed for cancer therapy and for use in forming immunoconjugates which in turn can 

15 be used in cancer therapy. (See, e.g., Buchegger et al., U.S. 5,047,507 (1991); Osbourne 
etal. U.S. 5,872,215 (1999).) 

CEA was as first described by Gold and Freedman, J. Exp. Med., 121 : 439, 1965, 
and has now been completely sequenced and characterized (see, Beauchemin et al, Mol 
Cell Biol, 7:3221-30, 1987; WO 95/06067). CEA has a domain structure of N-A1-B1- 

20 A2-B2-A3-B3-GPI where GPI is a glycophosphatidylinositol membrane anchor. A 
significant degree of sequence homology exists between the domains of CEA and other 
members of the immunoglobulin supergene family, and immunological cross-reactivity 
between CEA and as many as sixteen other homologous antigens, such as NCA and 
biliary glycoprotein- 1 (BGP-1), has been reported. 

25 One of the major drawbacks of the use of anti-CEA antibodies for clinical 

purposes has been the cross-reactivity of these antibodies with some apparently normal 
adult tissues. Previous studies have shown that most conventional hyperimmune antisera 
raised against different immunogenic forms of CEA cross-react with CEA-related 
antigens found in normal colonic mucosa, spleen, liver, lung, sweat glands, 
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polymorphonuclear leukocytes and monocytes of normal individuals, as well as many 
different types of carcinomas. 

Accordingly, there is a great need for binding moieties that bind to CEA but do 
not cross-react with other antigens such as NCA. This and other objects are 
5 accomplished herein with the discovery of novel peptide binders of CEA. 

SUMMARY OF THE INVENTION 

The present invention addresses the need for improved materials and methods for 
detecting, localizing, measuring and treating CEA-expressing cells by providing a group 

10 of non-naturally occurring polypeptides that bind specifically to CEA. Appropriate 
labeling of such polypeptides provides detectable imaging agents that bind at high 
concentration to a CEA-expressing tumor, providing excellent tumor-specific imaging 
agents. Conjugation or fusion of such polypeptides with effective agents such as 
cytokines, chemotherapeutic agents, radionuclides or other cancer therapeutics produce 

15 conjugates that can be used for cancer therapy, i.e., by causing the conjugate to target the 
site of a tumor that is producing CEA. Recombinant bacteriophage displaying the CEA- 
binding polypeptides of the invention have been identified and isolated, and such phage 
products are also valuable reagents for effective detection and diagnosis of cancers 
associated with the expression of CEA in cells and tissues. The CEA binding moieties of 

20 the instant invention can be used in the detection, diagnosis, and therapy of such CEA- 
related disorders. 

This invention pertains to CEA binding moieties. Binding moieties according to 
this invention are useful in any application where binding, detecting or isolating CEA or 
its fragments is advantageous. A particularly advantageous use of the binding moieties 

25 disclosed herein is in a method of imaging cells or tissues expressing CEA in vivo. The 
method entails the use of CEA specific binding moieties according to the invention for 
detecting CEA-expressing cells, where the binding moieties have been detectably labeled 
for use as imaging agents, including magnetic resonance imaging (MRI) contrast agents, 
x-ray imaging agents, radiopharmaceutical imaging agents, ultrasound imaging agents, 

30 and optical imaging agents. 
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Preferred CEA binding moieties according to the invention are isolated, synthetic 
polypeptides having a high affinity for CEA. This invention provides a new class of 
CEA binding polypeptides having an amino acid sequence comprising: 

X-X 2 -X 3 -Cys-X 4 -X 5 ^ (SEQ ID NO:l), 

5 wherein 

Xj is Asn, Asp, or is absent; 
X 2 is Trp; 

X 3 is Asp, Phe, or Val; 

X 4 is Asn, Glu, or Met; 
10 X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 7 is Ala, Gin, Gly, Lys, or Thr; 

X 8 is Arg, Asn, Asp, Glu, or Gly; 

X 9 is Gin, Gly, or Leu; 
15 X 10 is Ala, Trp or Tyr; 

X n is Ala, Gly, His, Phe, Thr, or Val; 

X 12 is Asn, Gin, Phe, Ser or Val; 

X 13 is Arg, Leu, Pro or Ser; and 

X 14 is Leu, Ser, Trp or Tyr; 
20 and wherein said polypeptide has the ability to bind CEA. Said polypeptide may have 

additional amino acids attached at either end. Peptides having a serine at the N-terminus 

(before X x ) are preferred embodiments. 

Preferred CEA binding polypeptides of the above formula will have the amino 

acid sequence: 

25 X-Trp-Val-Cys-Glu-X 5 -X 6 -Lys- X 8 -Gln-Trp- X n -Cys-Asn-X 13 -X 14 (SEQ ID 

NO:2), wherein 

X x is Asn or Asp; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, lie, Lys, Phe or Thr; 
30 X 8 is Arg, Asn, Asp, Glu, Gly or Trp; 



4 



X„ is Ala, Gly, His, Phe, Thr, Tyr or Val; 
X 13 is Arg, Leu, Pro or Ser; and 
X 14 is Leu, Ser, Trp or Tyr; 

In particular, a stable binding loop having a high affinity for CEA is disclosed, 
5 having the formula: Cys-X^Xs-Xg-Xy-Xg-Xg-Xjo-Xn-Cys (SEQ ID NO: 3), 
wherein 

X 4 is Asn, Glu, or Met; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 
10 X 7 is Ala, Gin, Gly, Lys, or Thr; 

X 8 is Arg, Asn, Asp, Glu, or Gly; 

Xg is Gin, Gly, or Leu; 

X 10 is Ala, Trp or Tyr; 

X n is Ala, Gly, His, Phe, Thr, or Val; 
15 and wherein it is preferred that 

X 4 is Glu; 

X 5 is Asn, Leu, Met or Phe; 
X 6 is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Lys; 
20 X 8 is Arg, Asn, Asp, Glu, or Gly; 
X 9 is Gin; 
X 10 is Trp; and 

X u is Ala, Gly, His, Phe, Thr, or Val. 

Preferred polypeptides according to the invention comprise an amino acid 
25 sequence: 

Asn-Trp-Val-Cys-Asn-Leu-Phe-Lys-Asn-Gln-Trp-Phe^ys-Asn-Ser-Tyr (SEQ ID 
NO:4)(TN10/9-G08), 

Asp-Trp-Val-Cys^lu-Asn^ (SEQ ID 

NO:5)(TN10/9-A07), 
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Asn-Trp-Asp^ys-Met-Phe-Gly-Ala-Glu-G^ (SEQ ID 

NO:6)(TN10/9-E01), 

Asp-Trp-Val^ys-Glu-Lys-Thr^ (SEQ ID 

NO:7)(TN10/9-B09), 

5 Asn-Trp-Phe-Cys-Glu-Met-Ile-Gly™^ (SEQ ID 

NO:8)(TN10/9-Fll),and 

Asp-Trp-Val-Cys-Asn-Phe-As^ (SEQ ID 

NO:9)(TN10/9-D04). 

The most preferred CEA binding moieties according to the invention are isolated, 
10 synthetic polypeptides having a high affinity for CEA. This invention provides a new 
class of CEA binding polypeptides having an amino acid sequence comprising: 

X 1 -X 2 -X 3 -Cys-X 4 -X 5 -X 6 -X 7 -X 8 -X 9 -X 10 -X 11 -Cys-X 12 -X 13 -X 14 (SEQ ID NO:l), 
wherein 

X l is Asp, Asn, Ala, or He, with Asp most preferred; 
15 X 2 isTrp; 

X 3 is Val, He, Met, Tyr, Phe, Pro, or Asp, with Val most preferred; 
X 4 is Asn, Glu, or Asp, with Asn and Glu most preferred; 
X 5 is Leu, Phe, Tyr, Trp, Val, Met, He, or Asn, with Leu most preferred; 
X 6 is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, Ser, Val, Trp, or Tyr, with Phe most 
20 preferred; 

X 7 is Lys, Phe, Asp, Gly, Leu, Asn, or Trp, with Lys most preferred; 

X 8 is Asn, Pro, Phe, Gly, Asp, Ala, Ser, Glu, Gin, or Trp, with Asn most preferred; 

Xg is Gin, or Lys, with Gin most preferred; 

X 10 is Trp; 

25 X„ is Phe, Thr, Met, Ser, Ala, Asn, Val, His, He, Pro, Trp, or Tyr, with Phe most 
preferred; 

X u is Asn, Asp, Glu, Pro, Gin, or Ser, with Asn and Asp most preferred; 

X 13 is Val, Leu, He, Pro, Ala, Gin, Ser, Met, Glu, Thr, Lys, or Trp, with Val and Leu most 

preferred; and 
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X 14 is Leu 5 Met, Val, Tyr, Ala, He, Trp, His, Pro, Gin, Glu, Phe, Lys, or Arg, with Leu 
most preferred. 

The polypeptides listed in Table 5 (infra) are preferred embodiments of the 
present invention. Polypeptides 304A-12-H12 (SEQ ID NO:59), 304A-14-B02 (SEQ ID 
5 NO:74), 304A-14-A12 (SEQ ID NO:83), and 304A-15-E04 (SEQ ID NO:92) are 
especially preferred embodiments of the present invention. 

Another aspect of the present invention relates to modifications of the foregoing 
polypeptides to provide CEA specific imaging agents, wherein the binding moieties are 
modified by radiolabeling, enzymatic labeling, or labeling with MR paramagnetic 
10 chelates; or wherein the binding moieties are incorporated in microparticles, ultrasound 
bubbles, microspheres, emulsions, or liposomes; or wherein the binding moieties are 
conjugated with optical dyes. 

In another aspect of the present invention, methods for isolating CEA binding 
moieties are provided. Such methods will be useful for isolating additional reagents for 
15 detection, localization, quantification, and treatment of neoplastic disorders associated 
with upregulated CEA expression. 

In another aspect of the invention, methods of diagnosing CEA-associated 
disorders and methods for localizing CEA-expressing cells or tissues, are provided, and 
methods for treating cancers indicated by increased CEA expression are provided. 
20 In another aspect of the invention, therapeutic agents comprising a combination, 

conjugation or fusion of a anticancer drug or other therapeutic agent with a CEA binding 
moiety according to the invention are provided. Such compositions will be useful in the 
treatment of CEA-associated disorders and conditions. 

In another aspect of the invention, recombinant bacteriophage displaying CEA 
25 binding polypeptides on their surfaces are also provided. Such phage are useful as 
screening reagents and reagents for detecting CEA. 

Another aspect of the invention relates to forming molecules containing multiple 
CEA-binding moieties to increase the residence time of these molecules on CEA targets. 
These multimeric molecules can be altered to provide CEA specific imaging agents by 
30 radiolabeling, enzymatic labeling, or labeling with MR paramagnetic chelates or 
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microparticles; ultrasound bubbles, microparticles, microspheres, emulsions, or 
liposomes; or optical dyes. 

Another aspect of the invention relates to introducing DNA that encodes one or 
more CEA-binding moieties into the coat protein of a virus to cause the virus to bind and 
preferentially infect CEA-bearing cells. Such alteration will make the virus target CEA- 
expressing (tumor) cells. 

These and other aspects of the present invention will become apparent with 
reference to the following detailed description. 

RRTFF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows results of a competition ELISA with CEA-binding phage isolated 
from the TN10/9 phage display library. The phage bearing CEA binding polypeptides are 
shown to compete for the same binding site (A3 domain) as the a-CEA(A3) chimeric 
antibody, cT84.66. 

Figure 2 shows ELISA scores for TNI 0/9 isolates A07 and G08, and Lib2 isolates 
304A-15-E04, 304A-12-H12, 304A-14-B02, and 304A-14-A12. In each group, the first 
bar is for wells each having 100 ng CEA. Successive bars are for 50 ng, 10 ng, 5 ng, 1 
ng, 0.5 ng, 0.1 ng and 0 CEA per well. 

Figure 3 shows ELISA scores for TNI 0/9 isolates A07 and G08, and Lib2 isolates 
304A-15-E04, 304A-12-H12, 304A-14-B02, and 304A-14-A12. The amount of CEA in 
each well is constant at 0.1 ng/well. A soluble peptide having the CEA binder sequence 
of the G08 isolate (SEQ ID NO: 4), DX-207, was added as an inhibitor to CEA binding. 
The first bar in each group represents 200 uM and successive bars represent 20 uM, 2 
uM, 200 uM, 20 uM, 2 uM, 0 M, and no target. 

DEFTNTTTONS 

In the following sections, the term "recombinant" is used to describe non-naturally 
altered or manipulated nucleic acids, host cells transfected with exogenous (non-native) 
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nucleic acids, or polypeptides expressed non-naturally, through manipulation of isolated 
DNA and transformation of host cells. Recombinant is a term that specifically encompasses 
DNA molecules which have been constructed in vitro using genetic engineering techniques, 
and use of the term "recombinant" as an adjective to describe a molecule, construct, vector, 
5 cell, polypeptide or polynucleotide specifically excludes naturally occurring such 
molecules, constructs, vectors, cells, polypeptides or polynucleotides. 

The term "bacteriophage" is defined as a bacterial virus containing a DNA core and 
a protective shell built up by the aggregation of a number of different protein molecules. 
The terms "bacteriophage" and "phage" are used herein interchangeably. 

10 The term "binding moiety" as used herein refers to any molecule, polypeptide, 

peptidomimetic or transformed cell ("transformant") capable of forming a binding complex 
with another molecule, polypeptide, peptidomimetic or cell. "CEA binding moiety" is a 
binding moiety that forms a complex with carcinoembryonic antigen (CEA) or a portion 
thereof, whether naturally expressed or synthetic or recombinant, soluble or membrane 

15 bound. Included among the portions of CEA specifically contemplated are the N- 

terminal domain (N), and intact domains Al, Bl, A2 5 B2, A3, or B3, or combinations of 
two or more such domains in a single conjugate or fusion protein. Particluar mention is 
made of the construct N-A3, which is a construct fusing the N-terminal domain of CEA 
with domain A3 of CEA. The A3 domain does not exhibit determinants having structural 

20 cognates in proteins known to be immunologically cross-reactive with CEA, and 

therefore binders to domain A3 may be capable of differentially binding to CEA and not 
those known structurally related antigens, such as, especially, NCA. Specific examples 
of CEA binding moieties are the polypeptides mentioned above (SEQ ID NOs: 1-9, 24-27, 
and 36-107), hybrid polypeptides incorporating such polypeptides, and recombinant cells or 

25 bacteriophage displaying any of such polypeptides. Also included within the definition of 
CEA binding moieties are polypeptides derived from a polypeptide having an amino acid 
sequence according to SEQ ID NO: 1, above, which have been modified for particular 
results (in addition to CEA or like polypeptide binding ability). Specific examples of 
modifications contemplated are C- or N-terminal amino acid substitutions or elongations, 

30 e.g., for the purpose of linking the binding moiety to a detectable imaging label or other 
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substrate. In addition to the detectable labels described further herein, other suitable 
substrates include anticancer drugs or other chemotherapeutic agents, enzymes, toxins, 
liposomes (e.g., loaded with a detectable label or chemotheraeutic agent), or a solid 
support, well, plate, bead, tube, slide, filter, or dish. Also specifically contemplated are 
5 substitutions of one or more cysteine residues that normally form disulfide links, for 

example substitution with non-naturally occurring amino acid residues having reactive side 
chains, for the purpose of forming a more stable bond between those amino acid positions 
than the former disulfide bond. All such modified CEA binding moieties are also 
considered CEA binding moieties so long as they retain the ability to bind CEA or a 
1 0 fragment or domain of CEA. 

The term "binding" refers to the determination by standard techniques that a 
binding moiety recognizes and binds reversibly to a given target. Such standard 
techniques include equilibrium dialysis, gel filtration, and the monitoring of 
spectroscopic changes that result from binding, e.g., using flourescence anisotropy, either 
1 5 by direct binding measurements or competition assays with another binder. 

The term "specificity" refers to a binding moiety having a higher binding affinity 
for one target over another. The term "CEA specificity" refers to a CEA binding moiety 
having a higher affinity for CEA as compared with another target, such as a serum protein 
(e.g., bovine serum albumin (BSA), human serum albumin (HSA)) or gelatin. 
20 The term "polypeptide" refers to a linear polymer of two or more amino acid 

residues linked with amide bonds, and the term "peptide" is used herein to refer to 
relatively short polypeptides, e.g., having fewer than about 30 amino acids. 

In the present application, a CEA binding moiety is said to "target" CEA- 
expressing cells if the binding moiety accumulates in or near the CEA-expressing cells or 
25 if the binding moiety is selectively taken up by the CEA-expressing cells or if the binding 
moiety is selectively taken up by and metabolized by the CEA-expressing cells. 
Substances that are not CEA binding moieties may be "targeted" to CEA-expressing cells 
by conjugation with CEA binding moieties of the present invention. 
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The term "cross-reactive" is used herein to describe binding associations between 
molecules akin to the binding of antibodies to antigens. It is to be understood to refer to 
non-covalent binding, not to the formation of covalent bonds. 

The term "detectably labeled" is to be understood as including linking a molecule 
5 to a dye (such as fluorescein), a radionuclide (such as 131 I), an enzyme (such as 

horseradish peroxidase), or detectable metal (such as a paramagnetic ion), which dye, 
radionuclide, enzyme or metal can thereafter be detected by appropriate means. The term 
"detectably labeled" also includes a binding moiety which has been synthesized to 
incorporate a radionuclide (such as 32 P, 35 S, or 14 C) in place of a non-radioactive isotope 
10 of the same element. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention provides novel binding moieties for CEA. Such binding 
moieties make possible the efficient detection, imaging, localization, and targeting of 

15 CEA or CEA-related polypeptides in tissues or in a solution or system that contains CEA 
or CEA-related polypeptides. In particular, the binding moieties of this invention, when 
appropriately labeled, are useful for detecting, imaging, localizing, and targeting CEA- 
expressing cells or for diagnosing CEA specific pathophysilogies. The CEA binding 
polypeptides disclosed herein can thus be used to form a variety of diagnostic and 

20 therapeutic agents for diagnosing and treating CEA associated diseases, such as colon 
cancer and other cancers characterized by overexpression of CEA in cells, as compared 
with levels of CEA expression in corresponding cells of normal individuals. The 
preferred binding moieties of the present invention bind CEA with high affinity, i.e., 
acting at low, physiologically relevant concentrations, comparable to known anti-CEA 

25 antibodies and other CEA-binding proteins. 

Preferred CEA binding polypeptides according to the invention will bind to CEA 
or a fragment thereof, but will not bind to other proteins that are known to be 
immunologically cross-reactive with CEA, such as NCA. 

Specific CEA binding polypeptides according to the present invention were 

30 isolated initially by screening of phage display libraries, that is, populations of 
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recombinant bacteriophage transformed to express an exogenous peptide loop on their 
surface. In order to isolate new polypeptide binding moieties for a particular target, such 
as CEA, screening of large peptide libraries, for example using phage display techniques, 
is especially advantageous, in that very large numbers (e.g., 5 x 10 9 ) of potential binders 
5 can be tested and successful binders isolated in a short period of time. 

In order to prepare a phage library of potential binding polypeptides to screen for 
members of the library that might be CEA binding peptides, a candidate binding domain 
is selected to serve as a structural template for the peptides to be displayed in the library. 
The phage library is made up of analogues of this template or "parental domain". The 
10 binding domain template may be a naturally occurring or synthetic protein, or a region or 
domain of a protein. The binding domain template may be selected based on knowledge 
of a known interaction between the binding domain template and CEA, but this is not 
critical. In fact, it is not essential that the domain selected to act as a template for the 
library have any affinity for the target at all: Its purpose is to provide a structure from 
1 5 which a multiplicity (library) of similarly structured polypeptides (analogues) can be 
generated, which multiplicity of analogues will hopefully include one or more analogues 
that exhibit the desired binding properties (and any other properties screened for). 

In selecting the parental binding domain or template on which to base the 
variegated amino acid sequences of the library, the most important consideration is how 
20 the variegated peptide domains will be presented to the target, i.e., in what conformation 
the peptide analogues will come into contact with the target. In phage display 
methodologies, for example, the analogues will be generated by insertion of synthetic 
DNA encoding the analogues into phage, resulting in display of the analogue on the 
surfaces of the phage. Such libraries of phage, such as M13 phage, displaying a wide 
25 variety of different polypeptides, can be prepared using techniques as described, e.g., in 
Kay et al., Phage Display of Peptides and Protein s: A Laboratory Manual (Academic 
Press, Inc., San Diego 1996) and U.S. 5,223,409 (Ladner et al.), both incorporated herein 
by reference. 

The phage libraries used in the present invention are constructed in derivatives of 
30 the filamentous phage Ml 3. The displayed peptides are fused to the amino terminus of 
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protein III through a linker peptide which contains the recognition site for Factor Xa. 
Factor Xa can cleave the displayed peptide from the phage without injuring the phage or 
reducing its infectivity. 

For formation of phage display libraries, it is preferred to use a structured 

5 polypeptide as the binding domain template, as opposed to an unstructured, linear 
peptide. Mutation of surface residues in a protein will usually have little effect on the 
overall structure or general properties (such as size, stability, and temperature of 
denaturation) of the protein; while at the same time mutation of surface residues may 
profoundly affect the binding properties of the protein. The more tightly a polypeptide 

10 segment is constrained, the less likely it is to bind to any particular target; however if the 
polypeptide does bind, the binding is likely to be of higher affinity and of greater 
specificity. Thus, it is preferred to select a parental domain and, in turn, a structure for 
the potential polypeptide binders, that is constrained within a framework having some 
degree of rigidity. In isolating the specific polypeptides according to this invention, four 

1 5 phage libraries were screen, each displaying a short, variegated exogenous peptide loop 
of 1 1, 12 or 16 amino acids on the surface of M13 phage, at the amino terminus of protein 
III. The libraries were designated TN6/6 (having a potential 3.3 x 10 12 amino acid 
sequence diversity), TN7/1 (having a potential 5.6 x 10 9 amino acid sequence diversity), 
TN8/6 (having a potential 6.3 x 10 9 amino acid sequence diversity), and TN10/9 (having 

20 a potential 3 x 10 16 amino acid sequence diversity). 

The TN6/6 library was constructed to display a single microprotein binding loop 
contained in a 12-amino acid template. The TN6/6 library utilized a template sequence of 
Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa (SEQIDNO: 10). Each 
amino acid position Xaa in the template was varied to permit any amino acid except 

25 cysteine (Cys). The number of potential designed sequences is 3.3 x 10 12 ; 2.0 x 10 8 
independent transformants were included in the library. 

The TN7/1 library was constructed to display a single microprotein binding loop 
contained in an 1 1-amino acid template. The TN7/1 library utilized a template sequence 
ofXaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa (SEQIDNO: 11). The 

30 amino acids at the first and last positions in the template (amino acid positions 1 and 1 1) 
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were varied to permit any amino acid selected from a group of seven amino acids (i.e., 
Phe, His, Pro, Leu, Ala, Asp, or Arg); the amino acids at amino acid positions 2 and 10 in 
the template were varied to permit any amino acid selected from a group of nine amino 
acids (i.e., Leu, Gly, His, Ser, Asp, Arg, Pro, Ala, or Phe); the amino acids at amino acid 
positions 4, 5, 6, 7, and 8 (i.e., between the invariant cysteine residues in the template) 
were varied to permit any amino acid selected from a group of seventeen amino acids 
(i.e., Thr, He, Trp, Glu, Tyr, Gin, Asn, Val, Leu, Gly, His, Ser, Asp, Arg, Pro, Ala, or 
Phe). The number of potential designed sequences is 5.6 x 10 9 ; about 1.0 x 10 9 
independent transformants were included in the library. 

The TN8/6 library was constructed to display a single microprotein binding loop 
contained in an 12-amino acid template. The TN8/6 library utilized a template sequence 
ofXaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa (SEQIDNO: 12). The 
amino acids at the first and last positions in the template (amino acid positions 1 and 12) 
were varied to permit any amino acid selected from a group of four amino acids (i.e., Ala, 
Asp, Arg, or His); the amino acids at amino acid positions 2 and 1 1 in the template were 
varied to permit any amino acid selected from a group of nine amino acids (i.e., Pro, Ala, 
Phe, Ser, Asp, Arg, Leu, Gly, or His); the amino acids at amino acid positions 4, 5, 6, 7, 8 
and 9 (i.e., between the invariant cysteine residues in the template) were varied to permit 
any amino acid selected from a group of thirteen amino acids (i.e., Pro, Ala, Phe, Ser, 
Asp, Arg, Leu, Gly, His, Gin, Asn, Val, or Trp). The number of potential designed 
sequences is 2.2 x 10 15 ; about 1.0 * 10 9 independent transformants were included in the 
library. 

The TNI 0/9 library was constructed to display a single microprotein binding loop 
contained in an 16-amino acid template. The TNI 0/9 library utilized a template sequence 
Xaa~Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa (SEQ 
ID NO: 13). The amino acids amino acid positions 1, 2, 15 and 16 of the template were 
varied to permit any amino acid selected from a group of ten amino acids (i.e., Tyr, Arg, 
Ser, Trp, Leu, Asn, Pro, Asp, Phe, or His); the amino acids at amino acid positions 3 and 
14 in the template were varied to permit any amino acid selected from a group of fourteen 
amino acids (i.e., Trp, Tyr, Arg, Ser, Val, Asn, Pro, Gin, Gly, His, Leu, Ala, Asp, or Phe); 
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the amino acids at amino acid positions 5, 6, 7, 8, 9, 10, 1 1 and 12 (i.e., between the 
invariant cysteine residues in the template) were varied to permit any amino acid except 
cysteine. The number of potential designed sequences is 3.0 x 10 16 ; and about 2.5 x 10 8 
independent transformants were included in the library. 
5 Such small binding loop peptides offer several advantages over large proteins: 

First, the mass per binding site is reduced, e.g., such highly stable and low molecular 
weight polypeptide domains can show much higher binding per gram than do antibodies 
(150 kDa) or single-chain antibodies (30 kDa). Second, the possibility of non-specific 
binding is reduced because there is less surface available. Third, small proteins or 
10 polypeptides can (because they are chemically synthesizable) be engineered to have 

unique tethering sites such as terminal polylysine segments in a way that is impracticable 
for larger proteins or antibodies. Fourthly, small peptides can be combined into homo- or 
hetero-multimers to give either hybrid binding or avidity effects. Fifthly, a constrained 
polypeptide structure is more likely to retain its functionality when transferred with the 
15 structural domain intact from one framework to another, that is, the binding domain 

structure is likely to be transferable from the framework used for presentation in a library 
(e.g., displayed on a phage) to an isolated protein removed from the presentation 
framework or immobilized on a chromatographic or other substrate. 

Each of the peptide loop libraries was created by making a designed series of 
20 mutations or variations within a coding sequence for the microprotein template, each 
mutant sequence encoding a binding loop analogue peptide corresponding in overall 
structure to the template except having one or more amino acid variations in the sequence 
of the template. The novel variegated (mutated) DNA provides sequence diversity, and 
each transformant phage displays one variant of the initial template amino acid sequence 
25 encoded by the DNA, leading to a phage population (library) displaying a vast number of 
different but structurally related amino acid sequences. The phage display libraries 
screened for CEA binders contained from 100 million to 1 billion variants of the 
respective parental domain peptides. The amino acid variations are expected to alter the 
binding properties of the binding loop or domain without significantly altering its 
30 structure, at least for most substitutions. It is preferred that the amino acid positions that 
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are selected for variation (variable amino acid positions) will be surface amino acid 
positions, that is, positions in the amino acid sequence of the domains which, when the 
domain is in its most stable conformation, appear on the outer surface of the domain (i.e., 
the surface exposed to solution). Most preferably the amino acid positions to be varied 

5 will be adjacent or close together, so as to maximize the effect of substitutions. 

As indicated previously, the techniques discussed in Kay et al, Phage Display of 
Peptides and Proteins: A Laboratory Manual (Academic Press, Inc., San Diego 1996) and 
U.S. 5,223,409 are particularly useful in preparing a library of potential binders 
corresponding to the selected parental template. The four libraries described above were 

10 prepared according to such techniques, and they was screened for CEA binding 

polypeptides against an immobilized CEA-relevant target (i.e., a synthetic fusion peptide 
target composed of a hexahistidine leader, the N domain of CEA and the A3 domain of 
CEA, designated "H6NA3"). 

In a typical screen, a phage library is contacted with and allowed to bind the 

15 target, in this case, CEA or a particular subcomponent(s), such as NA3, preferably 

presenting structures unique to CEA (i.e., structures not cross-reactive with NCA or other 
CEA-like antigens). The H6NA3 target was selected because the A3 domain was 
believed to be unique to CEA ; antibodies that bind A3 do not cross-react with other 
CEA-related antigens, such as NCA. 

20 To facilitate separation of binders and non-binders, it is convenient to immobilize 

the target on a solid support. When incubated in the presence of the target, phage bearing 
a target-binding moiety form a complex with the target on the solid support whereas non- 
binding phage remain in solution and may be washed away with buffer. Bound phage 
may then be liberated from the target by a number of means, such as changing the buffer 

25 to an extreme pH (pH 2 or pH 10), changing the ionic strength of the buffer, adding 
denaturants, or other known means. In the present case, CEA binders associating with 
immobilized target NA3 were eluted either by competition with a known CEA binding 
antibody (cT84.66, a chimeric mouse/human anti-A3 antibody supplied by The City of 
Hope, Duarte CA) or by cleavage with Factor Xa. 
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The recovered phage may then be amplified through infection of bacterial cells 
and the screening process repeated with the new pool that is now depleted in non-binders 
and enriched in binders. The recovery of even a few binding phage is sufficient to carry 
the process to completion. After a few rounds of selection, the gene sequences encoding 

5 the binding moieties derived from selected phage clones in the binding pool are 

determined by conventional methods, described below, revealing the peptide sequence 
that imparts binding affinity of the phage to the target. When the selection process 
works, the sequence diversity of the population falls with each round of selection until 
only good binders remain. The sequences converge on a small number of related binders, 

10 typically 10-50 out of the more than 100 million original candidates. An increase in the 
number of phage recovered at each round of selection, and of course, the recovery of 
closely related sequences are good indications that convergence of the library has 
occurred in a screen. After a set of binding polypeptides is identified, the sequence 
information may be used to design other secondary phage libraries, biased for members 

15 having additional desired properties. 

After analysis of the sequences isolated from the library screening, a family of 
particular CEA binders was defined. In addition, important consensus motifs were 
observed. The following sequences conforming to the TNI 0/9 template were found to 
bind a CEA target: 

20 Asnr-Tip-Val^ (SEQ ID 

NO:4)(TN10/9-G08); 

Asp-Trp-Val-Cys-Glu-Asn^ (SEQ ID 

NO:5)(TN10/9-A07); 

Asn-Tip-Asp-Cys-Met-Phe^ (SEQ ID 

25 NO:6)(TN10/9-E01); 

Asp-Trp-Val-Cys-Glu-Lys-T^^ (SEQ ID 

NO:7)(TN10/9-B09); 

Asn-Trp-Phe^ys-Glu-Met-Ile^ (SEQ ID 

NO:8)(TN10/9-Fll);and 
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Asp-Trp-Val^ys-Asn-Phe-^^ (SEQ ID 

NO:9)(TN10/9-G01). 

When displayed on the phage, these peptides are expected to form a disulfide 
bond between the Cys residues. In the synthetic peptides, the cysteines are preferably 
oxidized to form a disulfide. 

This series of CEA binders defines a family of polypeptides including the amino 

acid sequence: 

X^-Xs-^ (SEQ ID NO:l), 

wherein: 

Xj is Asn, Asp, or is absent; 
X 2 is Trp; 

X 3 is Asp, Phe or Val; 

X 4 is Asn, Glu or Met; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 7 is Ala, Gin, Gly, Lys or Thr; 

X 8 is Arg, Asn, Asp, Glu or Gly; 

Xg is Gin, Gly, or Leu; 

X 10 is Ala, Trp or Tyr; 

X n is Ala, Gly, His, Phe, Thr or Val; 

X 12 is Asn, Gin, Phe, Ser or Val; 

X 13 is Arg, Leu, Pro or Ser; and 

X 14 is Leu, Ser, Trp or Tyr; 
and wherein said polypeptide has the ability to bind CEA. 

The cysteine residues of the microprotein are believed to form a disulfide bond, 
which causes the microprotein to form a stable loop structure under non-reducing 
conditions. Thus, the invention relates to the discovery of a CEA binding loop 
comprising a polypeptide having the amino acid sequence: Cys-X 4 -X 5 -X 6 -X 7 -X 8 -X 9 - 
X 10 -X n -Cys (SEQ ID NO: 3), 
wherein: 
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X 4 is Asn, Glu or Met; 
X 5 is Asn, Leu, Met or Phe; 
X 6 is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Ala, Gin, Gly, Lys or Thr; 
5 X 8 is Arg, Asn, Asp, Glu or Gly; 

Xg is Gin, Gly or Leu; 
X 10 is Ala, Trp or Tyr; and 
X n is Ala, Gly, His, Phe, Thr or Val. 

Recurrent sequences among isolates recovered in screens and recurrence of certain 
10 amino acids at the same positions within the isolate peptides gives rise to a preferred 
family of CEA binding peptides and a sequence that may be useful for designing a 
secondary, directed library for obtaining even higher affinity CEA binders. The preferred 
family of peptides have amino acid sequences of the formula: 
Xl -Trp~Val-Cys-Glu-X 5 -X 6 -Lys- X 8 -Gln-Trp- X n ~Cys-Asn-X 13 -X 14 (SEQ ID 
15 NO:2), wherein 

Xj is Asn or Asp; 
X 5 is Asn, Leu, Met or Phe; 
X 6 is Asp, Gly, He, Lys, Phe or Thr; 
X 8 is Arg, Asn, Asp, Glu, or Gly; 
20 X n is Ala, Gly, His, Phe, Thr or Val; 

X 13 is Arg, Leu, Pro or Ser; and 
X, 4 is Leu, Ser, Trp or Tyr. 

The invention also involves a library of phage focused on improved binders to 
CEA. This library was constructed from seven sublibraries, each sublibrary allowing five 
25 amino acid positions to vary while holding constant nine of the amino acid positions that 
were varied in the initial library (TN10/9). The template structure of the seven 
sublibraries allowed variegation in the forms: 
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Varl: X r X 2 -X 3 -Cys-X 4 -X 5 -Lys-Lys-Asp-Gln-Trp-Thr-Cys-Asn-Leu-Leu (SEQ ID 
N0:14) 

Var2: Asp-Trp-Val-Cys-X 4 -X 5 -X 6 -X 7 -X 8 -Gln-Trp-Thr-Cys-Asn-Leu-Leu(SEQ ID 
NO: 15) 

5 Var3: Asp-Trp-Val-Cys-Glu-Asn-Lys-Xy-Xs-Xg-X^-Xn-Cys-Asn-Leu-Le^SEQ ID 
NO: 16) 

Var4: Asp-Trp-Val-Cys-Glu-Asn-Lys-Lys-Asp-Gln-X 10 -X n -Cys-X 12 -X 13 -X 14 (SEQ ID 
NO: 17) 

Var5: Asp-Trp-Val-Cys-Glu-X 5 -X 6 -Lys-X 8 -Gln-Tip-X n -Cys-Asn-X 13 -Leu (SEQ ED 
10 NO:18) 

Var6: Asn-Trp-Val-Cys-X 4 -X 5 -X 6 -Lys-X 8 -Gln-Trp-X n -Cys-Asn-Ser-Tyr (SEQ ID 
NO: 19) 

Var7: X r Trp-X3-Cys-Asn-Leu-Phe-Lys-Asn-Gln-Trp-Phe-Cys-X 12 -X 13 -X 14 (SEQ ID 
NO:20), 

15 wherein each X residue was allowed to be, with approximately equal likelihood, each of 
the genetically encodable amino acids except cysteine. The invariant portions of 
sequences Varl through Var5 were based on the most frequently observed CEA binder 
sequence in the initial screening of TN10/9 (see, SEQ ID NO:5, isolate TN10/9-A07). 
The invariant portions of sequences Var6 through Var7 were based on the binding 

20 peptide observed to have the best dissociation constant (see, SEQ ID NO:4, isolate 
TN10/9-G08). 

Additional, preferred CEA binding peptides were isolated from this focused 
library having the sequences as shown in Table 5 (infra). 

Direct synthesis of the peptides of the invention disclosed herein may be 
25 accomplished using conventional techniques including, preferably, solid-phase peptide 
synthesis, although solution-phase synthesis may also be used. In solid-phase synthesis, 
for example, the synthesis is commenced from the carboxy-terminal end of the peptide 
using an a-amino protected amino acid. t-Butyloxycarbonyl (Boc) protective groups can 
be used for all amino groups, though other protective groups are suitable. See, Stewart et 
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al., Solid-Phase Peptide Synthesis (1989), W. H. Freeman Co., San Francisco; and 
Merrifield, J. Am. Chem. Soc, 85:2149-2154 (1963). 

Polypeptides according to the invention may also be prepared commercially by 
companies providing peptide synthesis as a service (e.g., BACHEM Bioscience, Inc., 

5 King of Prussia, PA; Quality Controlled Biochemicals, Inc., Hopkinton, MA). 

Automated peptide synthesis machines, such as manufactured by Perkin-Elmer 
Applied Biosystems, also are available. 

The polypeptide compound is preferably purified once it has been isolated or 
synthesized by either chemical or recombinant techniques. For purification purposes, 

10 there are many standard methods that may be employed including reversed-phase 

high-pressure liquid chromatography (HPLC) using an alkylated silica column such as 
C 4 -, C 8 - or C 18 -silica. A gradient mobile phase of increasing organic content is generally 
used to achieve purification, for example, acetonitrile in an aqueous buffer, usually 
containing a small amount of trifluoroacetic acid. Ion-exchange chromatography can also 

15 be used to separate peptides based on their charge. The degree of purity of the 

polypeptide may be determined by various methods, including identification of a major 
large peak on HPLC. A polypeptide that produces a single peak that is at least 95% of 
the input material on an HPLC column is preferred. Even more preferable is a 
polypeptide that produces a single peak that is at least 97%, at least 98%, at least 99% or 

20 even 99.5% of the input material on an HPLC column. 

In order to ensure that the peptide obtained using any of the techniques described 
above is the desired peptide for use in compositions of the present invention, analysis of 
the peptide composition may be carried out. Such composition analysis may be 
conducted using high resolution mass spectrometry to determine the molecular weight of 

25 the peptide. Alternatively, the amino acid content of the peptide can be confirmed by 
hydrolyzing the peptide in aqueous acid, and separating, identifying and quantifying the 
components of the mixture using HPLC, or an amino acid analyzer. Protein sequenators, 
which sequentially degrade the peptide and identify the amino acids in order, may also be 
used to determine definitely the sequence of the peptide. 
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The new class of CEA binding polypeptides is designed to be conformationally 
restrained by disulfide linkages between the two cysteine residues in their sequence. This 
conformational restraint ensures that the peptides have a stable binding structure that 
contributes to the peptides' affinity for CEA and their specificity for CEA over non-CEA 
proteins. Other methods for constraining peptides which would retain a similar 
conformation and CEA specificity for the peptide have been described in the art and are 
contemplated herein, including the substitution of one or more of the cysteine residues 
with non-naturally occurring amino acids or peptidomimetics for the purpose of forming 
a more stable or conformationally preferred linkage between the two positions on the 
peptide. All such modified CEA binding moieties are also considered CEA binding 
moieties so long as they retain the ability to bind CEA or a portion thereof. Non- 
cyclized, or linear, versions of the peptides may also retain moderate binding ability and 
specificity for CEA and could also be employed in the present invention. 

Homologues of the CEA binding polypeptides described herein, as well as 
homologues to any subsequently discovered CEA binding polypeptides, may be formed 
by substitution, addition or deletion of one or more amino acids employing methods well 
known in the art and for particular purposes known in the art, such as addition of a 
polyhistidine "tail" in order to assist in purification or substitution of one up to several 
amino acids in order to obliterate an enzyme cleavage site. Other specifically 
contemplated homologues include polypeptides having N-terminal or C-terminal 
modifications or linkers, such as polyglycine or polylysine segments, and alterations to 
include functional groups, notably hydrazide (-NH-NH 2 ) functionalities, to assist in 
immobilization of binding peptides according to this invention on solid supports. 

Such homologous polypeptides will be understood to fall within the scope of the 
present invention so long as the substitution, addition or deletion of amino acids does not 
eliminate its ability to bind CEA. 

The term "homologous", as used herein, refers to the degree of sequence 
similarity between two polymers (i.e., polypeptide molecules or nucleic acid molecules). 
When the same nucleotide or amino acid residue occupies a sequence position in the two 
polymers under comparison, then the polymers are homologous at that position. The 
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percent homology between two polymers is the mathematical relationship of the number 
of homologous positions shared by the two polymers divided by the total number of 
positions compared, the product multiplied by 100. For example, if the amino acid 
residues at 60 of 100 amino acid positions in two polypeptide sequences match or are 
homologous then the two sequences are 60% homologous. The homology percentage 
figures referred to herein reflect the maximal homology possible between the two 
polymers, i.e., the percent homology when the two polymers are so aligned as to have the 
greatest number of matched (homologous) positions. Polypeptide homologues within the 
scope of the present invention will be at least 85% and preferably greater than 90% 
homologous to at least one of the CEA binding sequences disclosed herein. 

CEA binding polypeptides according to the present invention also may be 
produced using recombinant DNA techniques, utilizing nucleic acids (polynucleotides) 
encoding the polypeptides according to this invention and then expressing them 
recombinantly, i.e., by manipulating host cells by introduction of exogenous nucleic acid 
molecules in known ways to cause such host cells to produce the desired CEA binding 
polypeptides. Recombinant production of short peptides (e.g., 16-mers) such as those 
described herein may not be advantageous in comparison to direct synthesis, however 
recombinant means of production may be very advantageous where a CEA binding motif 
of this invention are desired to be incorporated in a hybrid polypeptide or fusion protein. 

The polynucleotides of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA and synthetic DNA. The DNA may be 
double-stranded or single-stranded, and if single stranded may be the coding strand or 
non-coding (anti-sense) strand. The coding sequences for CEA binding polypeptides 
according to the present invention may be manipulated or varied in known ways to yield 
alternative coding sequences that, as a result of the redundancy or degeneracy of the 
genetic code, encode the same polypeptide. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

Where recombinant production of CEA binding polypeptides is desired, the 
present invention also contemplates vectors that include polynucleotides of the present 
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invention, host cells that are genetically engineered with vectors of the invention, and 
recombinant polypeptides produced by culturing such genetically engineered host cells. 
Host cells are genetically engineered (transduced or transformed or transfected) with the 
vectors of this invention which may be, for example, a cloning vector or an expression 

5 vector. The vector may be, for example, in the form of a plasmid, a viral particle, a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying 
the CEA binder-encoding polynucleotides. The culture conditions, such as temperature, 
pH and the like, are those suitable for use with the host cell selected for expression and 

10 will be apparent to the skilled practitioner in this field. The polynucleotide may be 

included in any one of a variety of expression vectors for expressing a polypeptide. Such 
vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; 
vectors derived from combinations of plasmids and phage DNA, viral DNA such as 

15 vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 
be used as long as it is replicable and viable in the host. The appropriate DNA sequence 
may be inserted into the vector by a variety of procedures. In general, the DNA sequence 
is inserted into an appropriate restriction endonuclease site(s) by procedures known in the 
art. Such procedures and others are within the capability of those skilled in the art. 

20 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned LTR or SV40 
promoter, the E. coll lac or tip, the phage lambda P L promoter and other promoters 
known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. 

25 The expression vector also contains a ribosome binding site for translation initiation and a 
transcription terminator. The vector may also include appropriate sequences for 
amplifying expression. In addition, expression vectors preferably will contain one or 
more selectable marker genes to provide a phenotypic trait for selection of transformed 
host cells, such as dihydrofolate reductase or neomycin resistance for eukaryotic cell 
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culture, or such as tetracycline or ampicillin resistance for bacterial cell cultures such as 
E. coli. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 

5 appropriate host to permit the host to express the protein. As representative examples of 
appropriate host cells, there may be mentioned bacterial cells, such as E. coli, 
Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells such as 
Drosophila and Sf9; animal cells such as CHO, COS or Bowes melanoma; plant cells, 
etc. The selection of an appropriate host for this type of CEA binder production is also 

1 o within the capability of those skilled in the art from the teachings herein. Many suitable 
vectors and promoters useful in expression of proteins according to this invention are 
known to those of skill in the art, and many are commercially available. The following 
vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), 
pBS(+ or -), pDIO, pHagescript, psiX174, pBluescript SK, pBSKS, pNH8A, pNH16a, 

15 pNHl 8A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 
(Pharmacia). Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene) 
pSVK3, pBPV, pMSG, pSVL (Pharmacia). Any other plasmid or vector may be used as 
long as it is replicable and viable in the selected host cell. 

Introduction of the vectors into the host cell can be effected by any known 

20 method, including calcium phosphate transfection, DEAE-Dextran mediated transfection, 
or electroporation (see Davis et al., Basic Methods in Molecular Biology (1986); 
Sambrook et al., Molecular Cloning . ISBN 0-87969-309-6, (1987)). 

In the practice of the present invention, a determination of the affinity of the CEA 
binding moiety for CEA relative to other components of a sample is a useful measure, 

25 and is referred to as specificity for CEA. Standard assays for quantitating binding and 
determining affinity include equilibrium dialysis, equilibrium binding, gel filtration, 
surface plasmon resonance, micro-balances (Hengerer et al., Biotechniques, 26(5):956-60, 
962, 964 (1999)) or the monitoring of numerous spectroscopic changes (such as 
fluorescence) that may result from the interaction of the binding moiety and its target. 

30 These techniques measure the concentration of bound and free ligand as a function of 
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ligand (or protein) concentration. The concentration of bound polypeptide ([Bound]) is 
related to the concentration of free polypeptide ([Free]) and the concentration of binding 
sites for the polypeptide, i.e., on CEA, (N = total CEA), as described in the following 
equation: 

5 [Bound] - N x [Free]/((K D )+[Free]). 

A solution of the data to this equation yields the dissociation constant, K D , a 
quantitative measure of the binding affinity. The association constant, is the 
reciprocal of the dissociation constant, K D . A peptide having a K D 2 times higher for HSA 
(or some other non-CEA target such as NCA) than for CEA would be considered as a 

10 weak CEA binder. A peptide having a K D 10 times greater for HSA than CEA would be a 
moderate CEA binder, and a peptide having a K D 100 times or more greater for HSA than 
for CEA would be termed highly specific for CEA. Preferably the peptides and agents of 
the present invention have a K D at least 2 times higher for HSA than for CEA, more 
preferably at least 10 times higher, even more preferably at least 100 times, and most 

1 5 preferably at least 1 000 times higher. 

For most uses, the lower the dissociation constant, the better. Preferred CEA 
binders according to the invention will have a for CEA of less than 10 pM, more 
preferred CEA binders will have a for CEA of less than 1 jaM, and most preferred 
CEA binders will have a for CEA less than 0.1 |jM or lower. The first set of CEA 

20 binders isolated from the TN10/9 library had a for CEA in the range of 1 jiM to 7 \xM. 

Uses for CEA Binding Polypeptides 

The CEA binding moieties according to this invention will be useful for detecting 
the presence of CEA in blood or other biological fluids and/or for localizing or imaging 
25 of CEA expression in vitro or in vivo, and particularly for detection and/or imaging of 

CEA-expressing cells and tissues. Any suitable method of assaying or imaging CEA may 
be employed. 
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Detection of CEA 

Assays for CEA using the CEA binding moieties of the present invention can be 
direct binding, competitive binding, or sandwich assays. The CEA binding moiety can be 
attached to a surface and used in a surface-plasmon resonance or microbalance to detect 
the binding of CEA directly. 

Alternatively, bacteriophage that display a CEA binding peptide can be incubated 
with cells to be tested for CEA expression. The cells can be spun down in a manner that 
does not sediment the phage, and the presence of phage in the cell pellet can then be 
detected with labeled antibodies that bind to the phage. Another useful detection assay 
utilizes detectably labeled CEA binding moiety, which is mixed with cells to be tested for 
surface expression of CEA. After incubation, the cells are spun down and the presence 
(or absence) of CEA is detected by the presence (or absence) of the label in the cell pellet. 

In a further detection method, a CEA binding moiety can be immobilized, a 
sample to be tested is contacted with the immobilized CEA binding moiety, and after 
incubation, the sample is removed and the container washed. The presence of CEA is 
detected with a detectably labeled antibody that binds CEA. This antibody need not 
distinguish between CEA and other cross-reactive antigens where preferred polypeptides 
according to this invention are used, because only CEA will have been captured by the 
immobilized binding moiety (which, in preferred features, is specific for CEA and not for 
cross-reactive species such as NCA). 

In another method, a CEA binding moiety of the present invention is immobilized 
in a well; detectably labeled CEA is bound to the CEA binding moiety; a sample is then 
added, and the presence or absence of CEA is detected by the release of the labeled CEA 
or the retention of the labeled CEA. 

For detection or purification of CEA or CEA-expressing cells in or from a 
solution, a binding moiety of the invention can be immobilized on a solid substrate such 
as a chromatographic support or other porous material, then the immobilized binding 
moiety can be loaded or contacted with the solution under conditions suitable for 
formation of a binding moiety/CEA complex. The non-binding portion of the solution 
can be removed and the complex may be detected, e.g., using an anti-CEA or anti-binding 



27 



moiety antibody, or the CEA target may be released from the binding moiety at 
appropriate elution conditions. 

Tumor Imaging 

5 A particularly preferred use for the polypeptides according to the present 

invention is for creating visually readable images of tumors including neoplastic cells 
expressing high levels of CEA, to aid in the diagnosis, monitoring and treatment of CEA 
associated cancers or other disorders. 

The CEA binding moieties disclosed herein may be converted to imaging reagents 

10 for detecting CEA-expressing tumors by conjugating the polypeptides with a label 

appropriate for diagnostic detection. Preferably, a CEA binding polypeptide exhibiting 
much greater specificity for CEA than for NCA is used, A polypeptide according to this 
invention may be conjugated or linked to a label appropriate for the detection 
methodology to be employed. For example, the CEA binder may be conjugated with a 

15 paramagnetic chelate suitable for magnetic resonance imaging (MRI), with a radiolabel 
suitable for x-ray imaging, with an ultrasound microsphere or liposome suitable for 
ultrasound detection, or with an optical imaging dye. 

Suitable linkers for conjugating the polypeptide binder to a detectable label can be 
substituted or unsubstituted alkyl chains, amino acid chains (e.g., polyglycine), 

20 polyethylene glycols, polyamides, and other simple polymeric linkers known in the art. 
Many heterobifunctional linkers are also known and are commercially available. 
Detectable labels may also be bound directly to the CEA binding moieties, e.g., at a 
lysine side chain or other reactive site that does not interfere with CEA/binding moiety 
interaction. 

25 Molecules that contain multiple copies of a CEA-binding moiety are likely to 

have longer residence times both at the tumor and in circulation. It is desirable to have an 
agent that is intended to bind CEA for either imaging or therapy to remain in circulation 
for at least a few hours so that it has time to reach the tumor. Once the agent has reached 
the tumor, it is desirable that it stay there as long as possible. 
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Many techniques are known for preparing multimeric forms of binding molecules, 
and such techniques may be used to prepared CEA binding multimers having increased 
serum half life and higher avidity for CEA. For example, one of the CEA binding 
peptides of the present invention can be synthesized with a C-terminal extention of Gly- 

5 Gly-Lys. The side groups of the all other residues are protected in one way and the 

terminal Lys side group and carboxyl groups are protected in an orthogonal manner. The 
terminal Lys amine group is deprotected and a chelator group is attached. The carboxyl 
groups is deprotected and two copies are joined using Afunctional polyethylene glycol 
reagents. The other protecting groups are removed. Just before use, a suitable 

10 radionuclide is added. For imaging 99M Tc is a preferred radionuclide and HYNIC is a 
preferred chelator. Alternatively, moieties other than chelators could be coupled to the 
lysine extention. Whole antibodies show detectable avidity effects when binding to CEA. 
Thus comparable avidity effects are expected in a dimerized binding peptide according to 
the invention, if the peptide moieties of one molecule can be separated by a distance 

15 similar to the separation between the combining sites of an antibody (i.e., on the order of 
100 A. A linker containing -80 units of -(CH 2 -CH 2 -0)~ should allow sufficient 
separation and increase the serum residence time. 

In general, the technique of using a detectably labeled CEA binding moiety in 
vivo for diagnosis is based on the premise that the label generates a signal that is 

20 detectable outside the patient's body. When the detectably labeled CEA binding moiety 
is administered to the patient suspected of having a CEA-expressing tumor, the high 
affinity of the CEA binding moiety for CEA on a tumor causes the CEA binding moiety 
to bind to the tumor and accumulate label at the site of the tumor. Sufficient time is 
allowed for the labeled peptide to localize at the site of the tumor. The signal generated 

25 by the labeled peptide may then be detected by a scanning device which will vary 
according to the type of label used, and the signal is then converted to an image of the 
tumor. 

Therapeu tic Applications 

The CEA binding moieties of the present invention can be used to improve the 
30 activity of anti-cancer drugs or tumor-killing agents by providing or improving their 
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affinity for CEA. In this aspect of the invention, hybrid anti-tumor agents are provided 
by conjugating a CEA binding moiety according to the invention with a drug or other 
agent lethal to the tumor. The CEA binding moiety component of the conjugate causes 
the anti-tumor agent to target the sites of CEA-expressing cells, and to improve the 
affinity of the conjugate for the cells, so that the anti-tumor activity of the conjugate is 
more localized and concentrated at the sites of tumors. Such conjugates will be useful in 
treating CEA associated diseases, especially colon cancer, in humans and animals, which 
method comprises administering to a human or animal in need thereof an effective 
amount of a CEA binding moiety according to the invention conjugated with an 
appropriate therapeutic agent. The invention also provides the use of such conjugates in 
the manufacture of a medicament for the treatment of diseases associated with the 
overexpression of CEA by cells in humans and animals. 

A CEA binding moiety of the present invention may be advantageously used to 
target a toxin, radioactivity, cytolytic T cells, cytokines, chemotherapeutic agents or other 
molecules to a tumor expressing CEA. 

In the above treatment method, the compounds may be administered by any 
convenient route customary for anti-tumor treatments, for example by infusion or bolus 
injection. In a preferred mbodiment, the composition may be formulated in accordance 
with routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compositions for intravenous administration 
are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
also include a solubilizing agent and a local anaesthetic such as lignocaine to ease pain at 
the site of the injection. Generally, the ingredients will be supplied either separately or 
mixed together in unit dosage form, for example, as a dry lyophilised powder or water 
free concentrate in a hermetically sealed container such as an ampoule or sachette 
indicating the quantity of active agent in activity units. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing sterile 
pharmaceutical grade "water for injection" or saline. Where the composition is to be 
administered by injection, an ampoule of sterile water for injection or saline may be 
provided so that the ingredients may be mixed prior to administration. 
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The quantity of material administered will depend on the seriousness of the 
condition and position and size of the tumor. The precise dose to be employed and mode 
of administration must per force in view of the nature of the complaint be decided 
according to the circumstances by the physician supervising treatment. In general, 
5 dosages of the CEA binder/anti-tumor agent conjugate will follow the dosages that are 
routine for the anti-tumor agent alone, although the improved affinity for CEA added by 
the CEA binder component may allow a decrease in the standard dosage. 

Isolation of CEA binding moieties in accordance with this invention will be 
further illustrated in the following examples. The specific parameters included in the 
10 following examples are intended to illustrate the practice of the invention, and they are 
not presented to in any way limit the scope of the invention. 

Example 1 : Preparation of a CEA Target for Library Screening 

For screening libraries to isolate binding moieties for CEA, a truncated target 

15 protein, i.e., H6NA3, consisting of a hexahistidine leader and the N and A3 domains of 
CEA, was used, based on a presumption that binders directed at the A3 domain would not 
be cross-reactive with other antigens, such as NCA, having a high degree of homology to 
CEA. The recombinantly produced H6NA3 protein was dispersed in PBS and added to 
the wells of a 96-well polystyrene microtiter plate, at 1 |ig/well. The plate was allowed to 

20 stand overnight at 4°C, which was effective to immobilize target H6NA3 antigen on the 
plate. 

Example 2: Screening of Phage Display Libraries 

Four phage display libraries were used in the initial screening for CEA binding 
25 moieties. The libraries were designated TN6/6, TN7/1, TN8/6 and TN10/9. 

The TN6/6 phage display library was composed of recombinant Ml 3 phage 
displaying variegated exogenous single-loop peptides based on a microprotein template 
having the structure Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa (SEQ 
ID NO: 10) and providing 2.0 x 10 8 peptide diversity. 
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The TN7/1 phage display library was composed of recombinant Ml 3 phage 
displaying variegated exogenous single-loop peptides based on a microprotein template 
having the structure Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa (SEQ ID 
NO: 11) and providing 1 x 10 9 peptide diversity. 
5 The TN8/6 phage display library was composed of recombinant Ml 3 phage 

displaying variegated exogenous single-loop peptides based on a microprotein template 
having the structure Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa (SEQ 
ED NO: 12) and providing about 1.0 x 10 9 peptide diversity 

The TNI 0/9 phage display library was composed of recombinant Ml 3 phage 
10 displaying variegated exogenous single-loop peptides based on a microprotein template 
having the structure Xaa-Xaa-Xaa-Cys-Xaa-Xaa™Xaa-Xaa-Xaa-Xaa~Xaa-Xaa-Cys- 
Xaa-Xaa-Xaa (SEQ ID NO: 13) and providing 2.5 x 10 s peptide diversity. 

All of the libraries were constructed so that the phage expressed a variegated 
peptide at the amino terminus of protein III, and a constant Factor Xa cleavage site was 
15 provided between the diplay peptide and mature protein III. Each library was separately 
diluted into 100 \iL of binding buffer (50 mM Tris, 150 mM NaCl, 2 mM CaCl 2 , 0.05% 
Tween-20) before addition to H6NA3-coated wells. 

Each library was allowed to interact with the target separately. After a 2-hour 
incubation with the target to allow binding, the wells of the plate were washed 
20 extensively (15 times) to remove unbound or weakly bound phage. Bound phage were 
recovered by eluting the phage by two methods: first, a competing ligand for CEA, 
namely, a chimeric mouse/human anti-A3 monoclonal antibody, cT84.66 (added at 333 
nM), and second, by Factor Xa cleavage. The eluted phage were recovered and 
propagated overnight. 

25 The amplified phage recovered were concentrated, re-exposed to the target and 

eluted with either cT84.66 antibody or Factor Xa, and this procedure was repeated two 
more times. A progressive increase in the elution titer following each of the four rounds 
of screening indicated selection of phage having affinity for the NA3 target. 
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F.vam ple 3: Analysis of Indiv idual Isolates 

After four rounds of selection, the eluted phage were propagated and a portion 
plated to isolate phage plaques arising from individual clones. Ninety-four such clones 
were selected randomly, propagated, and tested individually for binding to NA3 in a dried 

5 H6NA3 plate ELISA. Dried H6NA3 plates were prepared as described above for the 
library screening. Phage samples (~10 9 phage each) were incubated in the H6NA3 plate 
wells in binding buffer (50 mM Tris, 150 mM NaCl, 2 mM CaCl 2 , 0.05% Tween-20) 
containing 0.1% HSA. After 1 hour, the plates were washed 5 times with binding buffer. 
Polyclonal anti-M13 antibody conjugated to horseradish peroxidase (Pharmacia) was 

10 added at 1/5000 dilution in binding buffer to the wells and incubated for 1 hour. The 
wells were again washed 5 times with binding buffer and the presence of the 
antibody/phage/NA3 complex was measured with HRP calorimetric reagents (3,3\5,5'- 
tetramethylbenzidine (TMB) and H 2 0 2 ). A high absorbance at 630 nm (due to oxidized 
TMB) was indicative of a tight phage/NA3 interaction, and phage clones corresponding 

15 to those wells were identified as bearing CEA-binding moieties. 

ELISAs to assay binding to immobilized HSA (passively bound to the 
polystyrene plate) and a target-free microtiter plate were controls to eliminate phage that 
bound promiscuously or nonspecifically. 

The amino acid sequences of the phage-displayed polypeptides from the ELISA 

20 positive clones (those positive for NA3 but negative for HSA and the polystyrene plate) 
were deduced by DNA sequencing. The amino acid sequence data from these phage 
isolates were sorted according to the degree of similarity and response in the H6NA3 
plate ELISA. The results of the screen from the TN10/9 library are set forth in Table 1. 
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Table 1: 


Amino acid sequences of CEA-binding polypeptides from the TN10/9 li 


jrary 


TN10/9 
isolate 


sequence 


competition 
elution (AbE) 


cleavage 
elution (FXE) 


overall 
fraction (94) 


SEQ 
ID NO: 


G08 


NWVCNLFKNQWFCNSY 


0/46 (0.00) 


6/48 (0.125) 


6/94 (0.064) 


4 


AO 7 


DWVCENKKDQWTCNLL 


42/46 (0.91) 


33/48 (0.69) 


75/94 (0.80) 


5 


E01 


NWDCMFGAEGWACSPW 


2/46(0.04) 


0/48 (0.00) 


2/94 (0.043) 


0 


B09 


DWVCELTTGGYVCQPL 


1/46 (0.022) 


0/48 (0.00) 


1/94 (0.011) 


7 


Fll 


NWFCEMIGRQWGCVPS 


0/46 (0.00) 


4/48 (0.083) 


4/94 (0.043) 


8 


D04 


DWVCNFDQGLAHCFPS 


0/46 (0.00) 


1/48 (0.021) 


1/94(0.011) 


9 


G01 


NWRCKLFPRYPYCSSW 


0/46 (0.00) 


1/48 (0.021) 


1/94 (0.011) 


21 


BIO 


-RYCEFFPWSLHCGRP 


1/46 (0.022) 


3/48 (0.063) 


4/94 (0.043) 


22 



The screens of the TN6/6, TN7/1 and TN8/6 libraries did not result in recovery of 
any high affinity CEA binders. Peptides G01 (SEQ ID NO:21) and B10 (SEQ ID NO:22) 
5 were later found not to bind CEA with useful affinity. 



Fygtri ptfl A: Sequence Conservati on Amonp TN1 0/9 Isolates 

The polypeptide sequences binding to the H6NA3 target define a cysteine- 
bracketed CEA binding loop often amino acids (including the cysteines), viz., Cys-X 4 - 
10 X 5 -X 6 -X-X 8 -X 9 -X 10 -X 1I -Cys (SEQ ID NO: 3), wherein 
X 4 is Asn, Glu or Met; 
X 5 is Asn, Leu, Met or Phe; 
X 6 is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Ala, Gin, Gly, Lys or Thr; 
15 X 8 is Arg, Asn, Asp, Glu or Gly; 

X, is Gin, Gly, Leu or Ser; 
X I0 is Ala, Trp or Tyr; and 
X u is Ala, Gly, His, Phe, Thr or Val, 
which forms a stable binding site for CEA. 
20 It is also clear from the selected isolates that one particular sequence, that of A07 

(SEQ ID NO:5), recurs with high frequency (75/94) and was recovered by both elution 
methods: competition with cT84.66 (AbE) and Factor Xa cleavage (FXE). Because this 
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polypeptide occurred with such high frequency among the isolates, it was regarded as a 
preferential binder, and the other sequences were compared against the A07 sequence to 
determine whether any amino acid positions in the 16-mer were conserved. A position 
was considered conserved if one of the other isolates exhibited the same amino acid at the 
5 same position relative to the invariant cysteine residues; a position was considered highly 
conserved if two or more of the other isolates exhibited the same amino acid at the same 
position relative to the invariant cysteine residues. From this analysis, the following 
conserved sequence (SEQ ED NO: 23) was observed: 

AA position: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 
conserved: DWVCEXXKXQWXfiNXL 

In this conserved sequence (SEQ ID NO: 23), the positions designated "X" showed no 
10 conservation of an A07 amino acid, the specified amino acid residues (at positions 8, 14 
and 16) were conserved, and the underscored amino acid residues were highly conserved. 
The conserved sequence was used as a parental template in the design of an additional, 
secondary library (Lib2) to be screened for additional high affinity binders of CEA. 

15 Example 5: Epitope Mapping of Phage Isolates 

Several of the phage isolates were tested to determine whether binding occurred in 
the CEA A3 domain, by performing an ELIS A with different concentrations of the anti- 
A3 chimeric antibody, cT84.66. Isolates G08, A07, E01, B09, Fl 1 and D04 were tested. 
In each assay, H6NA3 (100 ng/well) was coated on microtiter plate wells, blocked with 

20 2% non-fat dry milk (Carnation), and peptide displaying phage were added (5 x 

10 10 /well) in the presence or absence of cT84.66. After washing, binding to H6NA3 was 
detected using horseradish peroxidase-labeled anti-M13 antibody. Where cT84.66 was 
used, it was added at 10 nM or 100 nM in separate trials. Addition of BSA was 
performed as a control; and assays using non-CEA-binding phage B10 (SEQ ID NO:22) 

25 and G01 (SEQ ID NO:21) were also performed as negative controls. The results of the 
mapping ELIS A are shown in Fig. 1 . These results show that in each case, the previously 
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isolated CEA binding phage competed for the same binding site with the antibody 
cT84.66, which recognizes the A3 domain of CEA. 

Example 6: Binding Studies 
5 The affinity of the peptides displayed by the four highest affinity phage isolates 

(Fig. 1) were further tested in direct binding studies, 27-mer peptides including the 
binding loops of the G08, A07, E01 and B09 isolates were synthesized by solid-phase 
synthesis. The peptides thus prepared are shown in Table 2. 



Table 2: 


Amino acid sequences of CEA-binding polypeptides for binding studies 


Peptide 


Synthesized Polypeptide 


SEQ 
ID NO: 




112 2 
1 5 0 5 0 5 




P-G08 


SNWVCNLFKNQWFCNSYAPGGEGGGSK- CONH 2 


24 


P-A07 


SDWVCENKKDQWTCNLLAPGGEGGGSK- CONH 2 


25 


P-E01 


SNWDCMFGAEGWACSPWAPGGEGGGSK- CONH 2 


26 


P-B09 


SDWVCELTTGGYVCQPLAPGGEGGGSK- CONH 2 


27 



10 

As seen from Table 2, each of the peptides had an added N-terminal serine 
residue, replicating part of the context of the phage-display peptides. Each peptide was 
also provided with a C-terminal amide-functional linker useful for immobilizing peptides 
to various chromatographic substrates: -Ala 18 -Pro 19 -Gly 20 -Gly-Glu-Gly-Gly-Gly-Ser- 

15 Lys-CONH 2 (SEQ ID NO:28). The tripeptide -Ala 18 -Pro 19 -Gly 20 - replicates part of the 
context of the phage-displayed peptides, and the remainder of the C-terminus is a 
synthetic linker for immobilization. Aliquots of each peptide were fluorescently labeled 
using NHS-fluorescein reacted with the e-amino side group of the C-terminal lysine. 
Dissociation constants were determined using fluorescence anisotropy, through 

20 direct binding measurements and competition experiments. In direct binding assays, the 
concentration of the fluorescein-labeled peptide is held constant and the concentration of 
H6NA3 is varied. In the competition experiment, the concentration of the fluorescein- 
labeled peptide and the H6NA3 target are held constant and the concentration of a 
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competitor (cT84.66, unlabeled) is varied. The change in anisotropy is fit to the 
appropriate equation via nonlinear regression to obtain the apparent K d . The dissociation 
constants that describe binding of the four synthetic peptides for H6NA3 are set forth in 
Table 3. 



Table 3: 


Dissociation constants (K D ) for CEA binding peptides 


Polypeptide 


Amino Acid Sequence 


K D 

direct binding 


K D 

competition 


P-G08 


SEQIDNO:24 


1.9 ^iM 


(not done) 


P-A07 


SEQ ID NO: 25 


5.9 pM 


3.6 pM 


P-E01 


SEQ ID NO: 26 


6.9 \M 


5.3 


P-B09 


SEQ ED NO: 27 


6.0 pM 


1.0 nM 



These experiments show that the peptides bind CEA domain A3 with dissociation 
constants ranging from 1 to 7 |iM. From these tests, the polypeptide containing the G08 
sequence appears to be the highest affinity binder isolated. 

10 

Example 7: A library focused on improved CEA binding 

A second TN10 library, focused on improved binding peptides for CEA, was 
constructed, using sequence and binding information obtained in the previous examples. 
From the prevalence of sequences, the polypeptide A07 (SEQ ID NO: 5) appeared 

15 to be the best binder. Using this sequence as a secondary parental domain or template, 5 
amino acid positions within the A07 sequence were variegated. Five oligonucleotides 
were constructed that used A07 as the parental sequence and allowed five positions at a 
time to vary through all sequences that exclude cysteine. The oligonucleotide sequences 
thus encoded peptides having the designed sequences Varl through Var5 shown in Table 

20 4, below. After observing that the peptide G08 had the lowest K D , additional variegated 
oligonucleotides coding for peptides based on the parental sequence of G08 were 
designed and added to the focused library. The oligonucleotide sequences that encoded 
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G08-based peptides having the designed sequences Var6 and Var7 shown in Table 4, 
below. 



Table 4: Designed Polypeptides for Expression m Display Library "Lib2" 
X = any amino acid residue except Cys 


Encoded Peptides 


Ammo Acid Sequence 


QTtf \ 1 1 ^ XTfi* 
aJcA^J 1JJ INU. 


A07 (parental) 


DWVCENKKDQWTCNLL 


5 


Varl 


AAACAAlvKJjyVV 1 UNJ-Lu 




Var2 


DWVCXXXXXQWTCNLL 


30 


Var3 


DWVCENKXXXXXCNLL 


31 


Var4 


DWVCENKKDQXXCXXX 


32 


Var5 


DWVCEXXKXQWXCNXL 


33 


G08 (parental) 


NWVCNLFKNQWFCNSY 


4 


Var6 


NWVCXXXKXQWXCNSY 


34 


Var7 


XWXCNLFKNQWFCXXX 
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The oligonucleotides encoding the peptides Varl through Var7 were mixed in equal 
proportion and a secondary library ("Lib2"), allowing about 1.7 x 10 7 different 
sequences, was constructed. More than 5 x 10 8 transformants were obtained, so that all 
allowed sequences should be present. 

Example 8: Panning Lih2 and Isolates from Lib2 

Lib2 was panned through 3 rounds using lower amounts of target. After the third 
round, 4 x 96 isolates were test by ELISA for binding to NA3. Seventy-two isolates were 
ELISA positive and were sequenced. There were 71 distinct sequences, indicating that 
further screening may be needed to identify the very best sequences. The sequences are 
shown in Table 5, below. The seventy-one CEA binders of the Lib2 screen define a 
preferred family of binding moieties having a general formula: 
X-Trp-X 2 -<;ys-X 3 -X 4 ^^^ (SEQ ID NO:36), 

38 



wherein: 

X, is Asp, Asn, Ala, or He, with Asp most preferred; 
X 2 is Val, lie, Met, Tyr, Phe, Pro, or Asp, with Val most preferred; 
X 3 is Asn, Glu, or Asp, with Asn and Glu most preferred; 
5 X 4 is Leu, Phe, Tyr, Trp, Val, Met, He, or Asn, with Leu most preferred; 

X 5 is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, Ser, Val, Trp, or Tyr, with Phe most 
preferred; 

X 6 is Lys, Phe, Asp, Gly, Leu, Asn, or Trp, with Lys most preferred; 
X v is Asn, Pro, Phe, Gly, Asp, Ala, Ser, Glu, Gin, or Trp, with Asn most 
10 preferred; 

X 8 is Gin, or Lys, with Gin most preferred; 

Xg is Phe, Thr, Met, Ser, Ala, Asn, Val, His, He, Pro, Trp, or Tyr, with Phe most 
preferred; 

X 10 is Asn, Asp, Glu, Pro, Gin, or Ser, with Asn and Asp most preferred; 
15 Xj, is Val, Leu, lie, Pro, Ala, Gin, Ser, Met, Glu, Thr, Lys, or Trp, with Val and 

Leu most preferred; and 

X 12 is Leu, Met, Val, Tyr, Ala, lie, Trp, His, Pro, Gin, Glu, Phe, Lys, or Arg, with 
Leu most preferred. 
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SEQ ID 
NO: 




no 


ro 


cr* 
ro 


Sj* 


H 


CN 
si* 


ro 


«^ 

sj< 


lo 
si* 


st* 


st* 


st* 


DO 
St* 




o 
m 


H 

m 


CN 
LO 


ro 
m 


st* 

LD 


in 
m 


ID 

if) 


LO 


CO 
LO 


LO 




H 


S 


S 


S 


fa 




S 


H 


CM 


P4 


> 


> 


H 
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S 


a 


rH* 




rH* 






IS 


IS 






in 
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h^l 




O 


a 
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<C 
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►h 1 
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$3 


p 
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u 


U 


u 


u 


U 
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U 


U 
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U 


U 


U 


u 


U 


a 


U 
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u 
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IS 


IS 


IS 
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Amino Acid Position 
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Several isolates gave especially strong ELISA signals and these were tested 
further. Figure 2 shows the ability of TN10/9-G08, TN10/9-A07, 304A-15-E04, 304A- 
12-H12, 304A-14-B02, and 304A-14-A12 to bind to varying amounts of NA3. The 
binding of G08 and A07 drops off very quickly as the amount of NA3 drops below 50 ng 
5 per well. The binding of 304A-12-H12 and 304A-14-A12 do not drop off so quickly. 
The 304A-14-B02 and 304A-15-E04 show binding that is at least as good as G08 and 
A07. 

Figure 3 shows the abililty of TN10/9-G08, TN10/9-A07, 304A-15-E04, 304A- 
12-H12, 304A-14-B02, and 304A-14-A12 display phage to bind NA3 in the presence of a 
10 soluble peptide having the G08 sequence (inhibitor peptide, designated DX-207). Figure 
3 shows that phage displaying peptides A07 and G08 are unable to bind in the presence of 
2 |LtM inhibitor peptide, while 304A-12-H12, 304A-14-B02, and 304A-14-A12 show 
substantial binding in the presence of inhibitor. This indicates that these peptides have 
higher affinity for CEA domain A3 than do G08 or A07. 

15 

Although a number of embodiments and features have been described above, it 
will be understood by those skilled in the art that modifications and variations of the 
described embodiments and features may be made without departing from the present 
disclosure or the invention as defined in the appended claims. The publications cited 
20 herein are incorporated by reference. 
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CLAIMS 



A polypeptide having the ability to bind CEA comprising a CEA binding loop 
having the amino acid sequence Cys-X^Xj-X^-Xy-Xg-Xg-X^-Xn-^Cys (SEQ 
ID NO: 3), 
wherein 

X 4 is Asn, Glu, or Met; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys Phe or Thr; 

X 7 is Ala, Gin, Gly, Lys or Thr; 

X 8 is Arg, Asn, Asp, Glu or Gly; 

X 9 is Gin, Gly or Leu; 

X 10 is Ala, Trp or Tyr; 

X„ is Ala, Gly, His, Phe, Thr or Val; 

A polypeptide according to Claim 1, wherein 
X 4 is Glu; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, lie, Lys, Phe or Thr; 

X 7 is Lys; 

X 8 is Arg, Asn, Asp, Glu or Gly; 
X 9 is Gin; 
X 10 is Trp; and 

X„ is Ala, Gly, His, Phe, Thr or Val. 

A polypeptide according to Claim 1, comprising the amino acid sequence: 
X^-X^-Cys-X^Xs^ (SEQ ID 

NO:l), 
wherein 
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X] is Asn or Asp; 
X 2 is Trp; 

X 3 is Asp, Phe or Val; 

X 4 is Asn, Glu or Met; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 7 is Ala, Gin, Gly, Lys or Thr; 

X 8 is Arg, Asn, Asp, Glu or Gly; 

X 9 is Gin, Gly or Leu; 

X 10 is Ala, Trp or Tyr; 

X„ is Ala, Gly, His, Phe, Thr or Val; 

X 12 is Asn, Gin, Phe, Ser or Val; 

X 13 is Arg, Leu, Pro or Ser; and 

X 14 is Leu, Ser, Trp or Tyr. 

4. A polypeptide according to Claim 3, having the amino acid sequence: 
Xl -Trp-Val-Cys-Glu-X 5 -X 6 -Lys- X 8 -Gln-Trp- X n -Cys-Asn-X 13 -X 14 (SEQ 
ID NO:2), wherein 

Xj is Asn or Asp; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 8 is Arg, Asn, Asp, Glu or Gly; 

X n is Ala, Gly, His, Phe, Thr or Val; 

X 13 is Arg, Leu, Pro or Ser; and 

X 14 is Leu or Tyr. 

5. A polypeptide according to Claim 3, comprising an amino acid sequence selected 
from the group consisting of: 
Asn-Trp-Val^ys-Asn-Leu-Phe-L^ 

(SEQ ID NO:4), 



45 



Asp-Trp-Val-Cys-Glu-Asn-Lys-Lys-Asp-Gln-Trp-Thr-Cys-Asn-Leu-Leu 
(SEQ ID N0:5), 

Asn-Trp-Asp-Cys-Met-Phe-Gly-Ala-Glu-Gly-Trp-Ala-Cys-Ser-Pro-Trp 
(SEQIDNO:6); 

Asp-Trp-Val^ys-Glu-Lys-Thr-Thr-Gly-Gly-Tyr-Val^ys-<jln-Pro-Leu 
(SEQIDNO:7); 

Asn-Trp-Phe^ys^lu-Met-Ile^Gly-Arg-Gln-Trp-Gly-Cys-Val-Pro-Ser 
(SEQEDNO:8);and 

Asp-Trp-Val-^ys-Asn-Phe^-Asp-Gln--Gly-Leu-Ala-His-Cys-Phe-Pro-Ser 
(SEQ ED NO:9). 

A polypeptide having the ability to bind CEA comprising the amino acid 
sequence: X^X^-Cys-X^Xs-Xg-Xv-Xg-X^Xjo-Xn-Cys-X^-X^-X^ 
(SEQ ID NO: 1), 
wherein 

X, is Asp, Asn, Ala, or lie; 

X 2 is Trp; 

X 3 is Val, lie, Met, Tyr, Phe, Pro, or Asp; 
X 4 is Asn, Glu, or Asp; 

X 5 is Leu, Phe, Tyr, Trp, Val, Met, He, or Asn; 

Xg is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, Ser, Val, Trp, or Tyr; 

X 7 is Lys, Phe, Asp, Gly, Leu, Asn, or Trp; 

X 8 is Asn, Pro, Phe, Gly, Asp, Ala, Ser, Glu, Gin, or Trp; 

X, is Gin, or Lys; 

X, 0 is Trp; 

X n is Phe, Thr, Met, Ser, Ala, Asn, Val, His, lie, Pro, Trp, or Tyr; 
X 12 is Asn, Asp, Glu, Pro, Gin, or Ser; 

X 13 is Val, Leu, He, Pro, Ala, Gin, Ser, Met, Glu, Thr, Lys, or Trp; and 
X 14 is Leu, Met, Val, Tyr, Ala, He, Trp, His, Pro, Gin, Glu, Phe, Lys, or 
Arg. 
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7. A polypeptide according to Claim 6, comprising an amino acid sequence as 
depicted in Table 5 (SEQ ID NOs:37-107). 

8. A polypeptide according to Claim 1, 3 or 6, wherein said polypeptide binds to 
CEA but does not bind to NCA. 

9. A polypeptide according to Claim 1, 3 or 6, wherein said polypeptide has a K d for 
CEA which less than 7 jaM. 

10. A method of detecting CEA in a subject comprising the steps of: 

a) detectably labeling a polypeptide according to any one of Claims 1-7; 

b) administering to said subject the labeled polypeptide; and, thereafter, 

c) detecting the labeled polypeptide in the subject. 

1 1 . The method according to Claim 10, wherein said polypeptide is labeled with a 
radioactive compound. 

12. The method according to Claim 1 1, wherein said radioactive compound includes 
indium. 

13. The method according to Claim 1 1, wherein said radioactive compound includes 
technetium. 

14. The method according to Claim 10, wherein said detecting step is indicative of 
colon cancer, breast cancer, lung cancer, cervical cancer, ovarian cancer, stomach 
cancer, bladder cancer, pancreatic cancer or esophageal cancer. 
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15. A method of treating a CEA associated disease comprising the step of: 
administering to a subject in need of treatment for such a disease a composition 
comprising a polypeptide according to any one of Claims 1-7 conjugated with a 
therapeutic agent effective for treating said disease. 

16. The method according to Claim 15, wherein said CEA associated disease is colon 
cancer, breast cancer, lung cancer, cervical cancer, ovarian cancer, stomach 
cancer, bladder cancer, pancreatic cancer or esophageal cancer. 

1 7. The method according to Claim 15, wherein said therapeutic agent is a radioactive 
agent. 

18. The method according to Claim 15, wherein said therapeutic agent is a 
chemotherapeutic agent. 

19. The method according to Claim 15, wherein said therapeutic agent is a toxin or 
enzyme. 

20. A recombinant bacteriophage expressing exogenous DNA encoding a CEA 
binding polypeptide having an amino acid sequence comprising: 
X 1 -X 2 -X 3 -Cys-X 4 -X 5 -X 6 -X 7 -X 8 -X 9 -X 10 -X n -Cys-X 12 -X 13 -X 14 (SEQ ID 
NO:l), 

wherein 

X x is Asp, Asn, Ala, or He; 
X 2 is Trp; 

X 3 is Val, He, Met, Tyr, Phe, Pro, or Asp; 
X 4 is Asn, Glu, or Asp; 

X 5 is Leu, Phe, Tyr, Trp, Val, Met, He, or Asn; 

Xg is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, Ser, Val, Trp, or Tyr; 

X 7 is Lys, Phe, Asp, Gly, Leu, Asn, or Trp; 



48 



* 



X 8 is Asn 5 Pro, Phe, Gly, Asp, Ala, Ser, Glu, Gin, or Trp; 
X 9 is Gin, or Lys; 
X l0 is Trp; 

X u is Phe, Thr, Met, Ser, Ala, Asn, Val, His, He, Pro, Trp, or Tyr; 
X }2 is Asn, Asp, Glu, Pro, Gin, or Ser; 

X 13 is Val, Leu, He, Pro, Ala, Gin, Ser, Met, Glu, Thr, Lys, or Trp; and 
X 14 is Leu, Met, Val, Tyr, Ala, lie, Trp, His, Pro, Gin, Glu, Phe, Lys, or 
Arg 

and wherein said binding polypeptide is displayed on the surface of said 
bacteriophage. 

21 . A recombinant bacteriophage according to Claim 20, expressing exogenous DNA 
encoding an amino acid sequence selected from the group consisting of: 

Asn-Trp-Val-Cys-Asn-Leu-Phe-Lys-Asn-Gln-Trp-Phe-Cys-Asn- 
Ser-Tyr (SEQ ID NO:4), and 

Asp-Trp-Val^ys-Glu-Asn-Lys-Lys-Asp-Gln-Trp-Thr-Cys-Asn~ 
Leu-Leu (SEQ ID NO:5). 



22. A recombinant bacteriophage according to Claim 20, expressing exogenous DNA 
encoding an amino acid sequence selected from the group of sequences depicted 
in Table 5 (SEQ ID NOs: 37-107). 
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ABSTRACT 

The present invention provides binding moieties for CEA, which have a variety of 
uses wherever detecting, isolating or localizing CEA, and particularly CEA as opposed to 
cross-reactive antigens such as NCA, is advantageous. Particularly disclosed are 
synthetic, isolated polypeptides capable of binding CEA, which is overexpressed in 
adenocarcinomas of endodermally derived digestive system epithelia and fetal colon. 
Such polypeptides and disclosed derivatives are useful, e.g., as imaging agents for CEA- 
expressing tumors. 
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SEQUENCE LISTING 

<110> Rondon, Isaac J 
Ladner, Robert C 

<120> BINDING PEPTIDES FOR CARCINOEMBRYONIC ANTIGEN (CEA) 

<130> Sequence Listing DYX-016.0 ITS 

<140> (not yet assigned) 
<141> 2000-04-03 

<160> 107 

<170> Patentln Ver. 2,1 

<210> 1 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa is Asn, Asp or is absent 
<220> 

<221> VARIANT 

<222> (2) 

<223> Xaa is Trp 

<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa is Asp, Phe or Val 
<220> 

<221> VARIANT 
<222> (5) 

<223> Xaa is Asn, Glu or Met 
<220> 

<221> VARIANT 
<222> (6) 
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<223> Xaa is Asn, Leu, Met or Phe 
<220> 

<221> VARIANT 
<222> (7) 

<223> Xaa is Asp, Gly, lie, Lys, Phe or Thr 
<220> 

<221> VARIANT 
<222> (8) 

<223> Xaa is Ala, Gin, gly Lys or Thr 
<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is Arg, Asn, Asp, Glu or Gly 
<220> 

<221> VARIANT 
<222> (10) 

<223> Xaa is Gin, Leu or Gly 
<220> 

<221> VARIANT 
<222> (11) 

<223> Xaa is Ala, Trp or Tyr 

<220> 

<221> VARIANT 
<222> (12) 

<223> Xaa is Ala, Gly, His, Phe, Thr or Val 
<220> 

<221> VARIANT 
<222> (14) 

<223> Xaa is Asn, Gin, Phe, Ser or Val 
<220> 

<221> VARIANT 
<222> (15) 

<223> Xaa is Arg, Leu Pro or Ser 
<220> 

<221> VARIANT 
<222> (16) 

<223> Xaa is Leu, Ser, Trp or Tyr 
<400> 1 
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Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
15 10 15 



<210> 2 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: family of 
preferred CEA binding moieties 

<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa is Asn or Asp 
<220> 

<221> VARIANT 
<222> (6) 

<223> Xaa is Ph, Met, Leu or Asn 
<220> 

<221> VARIANT 
<222> (7) 

<223> Xaa is Asp, Gly, lie, Lys, Phe or Thr 
<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is Arg, Asn, Asp, Glu, Gly or Trp 
<220> 

<221> VARIANT 
<222> (12) 

<223> Xaa is Ala, Gly, His, Phe, Thr, Tyr or Val 
<220> 

<221> VARIANT 
<222> (15) 

<223> Xaa is Arg, Leu, Pro or Ser 
<220> 

<221> VARIANT 
<222> (16) 

<223> Xaa is Leu, Ser, Trp or Tyr 
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<400> 2 

Xaa Trp Val Cys Glu Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Xaa Xaa 
15 10 15 



<210> 3 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
loop 

<220> 

<221> VARIANT 
<222> (2) 

<223> Xaa is Asn, Glu or Met 
<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa is Asn, Leu, Net or Phe 
<220> 

<221> VARIANT 
<222> (4) 

<223> Xaa is Asp, Gly, lie, Lys, Phe or Thr 
<220> 

<221> VARIANT 
<222> (5) 

<223> Xaa is Ala, Gin, Gly, Lys or Thr 
<220> 

<221> VARIANT 
<222> (6) 

<223> Xaa is Arg, Asn, Asp, Glu or Gly 
<220> 

<221> VARIANT 
<222> (7) 

<223> Xaa is Gin, Gly or Leu 
<220> 

<221> VARIANT 
<222> (8) 

<223> Xaa is Ala, Trp or Tyr 
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<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is Ala, Gly, His, Phe, Thr or Val 
<400> 3 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
15 10 



<210> 4 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 4 

Asn Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asn Ser Tyr 
15 10 15 



<210> 5 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 5 

Asp Trp Val Cys Glu Asn Lys Lys Asp Gin Trp Thr Cys Asn Leu Leu 
15 10 15 



<210> 6 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 
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<400> 6 

Asn Trp Asp Cys Met Phe Gly Ala Glu Gly Trp Ala Cys Ser Pro Trp 
15 10 15 



<210> 7 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 7 

Asp Trp Val Cys Glu Lys Thr Thr Gly Gly Tyr Val Cys Gin Pro Leu 
15 10 15 



<210> 8 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 8 

Asn Trp Phe Cys Glu Met lie Gly Arg Gin Trp Gly Cys Val Pro Ser 
15 10 15 



<210> 9 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 9 

Asp Trp Val Cys Asn Phe Asp Gin Gly Leu Ala His Cys Phe Pro Ser 
15 10 15 



<210> 10 
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<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: parental 

domain for design of microprotein display library 

<220> 

<221> VARIANT 
<222> (1) . • (12) 

<223> amino acid positions 4 and 9 are invariant Cys; 

all other positions Xaa are varied but not Cys, to 
provide a library of 2x10(8) different peptides 
based on the template sequence 

<400> 10 

Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
15 10 



<210> 11 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: parental 

domain for design of microprotein display library 

<220> 

<221> VARIANT 
<222> (1) • . (ID 

<223> amino acid positions 3 and 9 are invariant Cys; 

all other positions Xaa are varied but not Cys, to 
provide a library of 1x10(9) different peptides 
based on the template sequence 

<400> 11 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa 
15 10 



<210> 12 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: parental 

domain for design of microprotein display library 

<220> 

<221> VARIANT 
<222> (1) *. (12) 

<223> amino acid positions 3 and 10 are invariant Cys; 

all other positions Xaa are varied but not Cys, to 
provide a library of 1x10(9) different peptides 
based on the template sequence 

<400> 12 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa 
15 10 



<210> 13 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: parental 

domain for design of microprotein display library 

<220> 

<221> VARIANT 
<222> (1)..(16) 

<223> amino acid positions 4 and 13 are invariant Cys; 

all other positions Xaa are varied but not Cys, to 
provide a library of 2.5x10(8) different peptides 
based on the template sequence 

<400> 13 

Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
15 10 15 



<210> 14 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 
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<220> 

<221> VARIANT 
<222> (1) (3) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (5) . . (6) 

<223> Xaa is any amino acid except Cys 
<400> 14 

Xaa Xaa Xaa Cys Xaa Xaa Lys Lys Asp Gin Trp Thr Cys Asn Leu Leu 
1 5 10 15 



<210> 15 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
<222> (5) . • (9) 

<223> Xaa is any amino acid except Cys 
<400> 15 

Asp Trp Val Cys Xaa Xaa Xaa Xaa Xaa Gin Trp Thr Cys Asn Leu Leu 
X 5 10 15 



<210> 16 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
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<222> (8) (12) 

<223> Xaa is any amino acid except Cys 
<400> 16 

Asp Trp Val Cys Glu Asn Lys Xaa Xaa Xaa Xaa Xaa Cys Asn Leu Leu 
15 10 15 



<210> 17 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
<222> (11) . . (12) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (14) • . (16) 

<223> Xaa is any amino acid except Cys 
<400> 17 

Asp Trp Val Cys Glu Asn Lys Lys Asp Gin Xaa Xaa Cys Xaa Xaa Xaa 
1 5 10 15 



<210> 18 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
<222> (6) . . (7) 

<223> Xaa is any amino acid except Cys 
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<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (12) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (15) 

<223> Xaa is any amino acid except Cys 
<400> 18 

Asp Trp Val Cys Glu Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Xaa Leu 
15 10 15 



<210> 19 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sublibrary sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
<222> (5) . . (7) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (12) 

<223> Xaa is any amino acid except Cys 
<400> 19 

Asn Trp Val Cys Xaa Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Ser Tyr 
15 10 15 
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<210> 20 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: variable 
sub library sequence used in designing focused 
secondary library 

<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (14) • . (16) 

<223> Xaa is any amino acid except Cys 
<400> 20 

Xaa Trp Xaa Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Xaa Xaa Xaa 
15 10 15 



<210> 21 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: isolate of 
TN10/9 library found not to bind CEA 

<400> 21 

Asn Trp Arg Cys Lys Leu Phe Pro Arg Tyr Pro Tyr Cys Ser Ser Trp 
15 10 15 



<210> 22 
<211> 15 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: isolate of 
TN10/9 library found not to bind CEA 

<400> 22 

Arg Tyr Cys Glu Phe Phe Pro Trp Ser Leu His Cys Gly Arg Pro 
15 10 15 



<210> 23 

<211> 16 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: conserved 
amino acid positions in first family of CEA 
binding peptides 



<220> 

<221> VARIANT 
<222> (6) 

<223> X is Asn, Leu, Met or Phe 



<220> 

<221> VARIANT 
<222> (7) 

<223> X is Asp, Gly, lie, Lys, Phe or Thr 



<220> 

<221> VARIANT 
<222> (9) 

<223> X is Arg, Asn, Asp, Glu or Gly 



<220> 

<221> VARIANT 
<222> (12) 

<223> X is Ala, Gly, His, Phe, Thr or Val 



<220> 

<221> VARIANT 
<222> (15) 

<223> X is Arg, Leu, Pro or Ser 



<400> 23 
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Asp Trp Val Cys Glu Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Xaa Leu 
15 10 15 



<210> 24 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic CEA 
binding peptide with C- terminal immobilization 
sequence 

<400> 24 

Ser Asn Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asn Ser 
15 10 15 

Tyr Ala Pro Gly Gly Glu Gly Gly Gly Ser Lys 
20 25 



<210> 25 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic CEA 
binding peptide with C- terminal immobilization 
sequence 

<400> 25 

Ser Asp Trp Val Cys Glu Asn Lys Lys Asp Gin Trp Thr Cys Asn Leu 
15 10 15 

Leu Ala Pro Gly Gly Glu Gly Gly Gly Ser Lys 
20 25 



<210> 26 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic CEA 
binding peptide with C- terminal immobilization 
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sequence 



<400> 26 

Ser Asn Trp Asp Cys Met Phe Gly Ala Glu Gly Trp Ala Cys Ser Pro 
15 10 15 

Trp Ala Pro Gly Gly Glu Gly Gly Gly Ser Lys 
20 25 



<210> 27 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic CEA 
binding peptide with O terminal immobilization 
sequence 

<400> 27 

Ser Asp Trp Val Cys Glu Leu Thr Thr Gly Gly Tyr Val Cys Gin Pro 
15 10 15 

Leu Ala Pro Gly Gly Glu Gly Gly Gly Ser Lys 
20 25 

<210> 28 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: C- terminal 
sequence for immobilizing peptides 

<400> 28 

Ala Pro Gly Gly Glu Gly Gly Gly Ser Lys 
15 10 

<210> 29 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequences template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
<222> (1)..(3) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (5) . . (6) 

<223> X is any amino acid except Cys 
<400> 29 

Xaa Xaa Xaa Cys Xaa Xaa Lys Lys Asp Gin Trp Thr Cys Asn Leu Leu 
15 10 15 



<210> 30 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
<222> (5) . . (9) 

<223> X is any amino acid except Cys 
<400> 30 

Asp Trp Val Cys Xaa Xaa Xaa Xaa Xaa Gin Trp Thr Cys Asn Leu Leu 
15 10 15 



<210> 31 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 



16 



<220> 

<221> VARIANT 
<222> (8) -.(12) 

<223> X is any amino acid except Cys 
<400> 31 

Asp Trp Val Cys Glu Asn Lys Xaa Xaa Xaa Xaa Xaa Cys Asn Leu Leu 
15 10 15 



<210> 32 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
<222> (11) . . (12) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (14).. (16) 

<223> X is any amino acid except Cys 
<400> 32 

Asp Trp Val Cys Glu Asn Lys Lys Asp Gin Xaa Xaa Cys Xaa Xaa Xaa 
15 10 15 



<210> 33 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
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<222> (6) -.(7) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (9) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (12) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (15) 

<223> X is any amino acid except Cys 
<400> 33 

Asp Trp Val Cys Glu Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Xaa Leu 
15 10 15 



<210> 34 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
<222> (5).. (7) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (9) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (12) 

<223> X is any amino acid except Cys 
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<400> 34 

Asn Trp Val Cys Xaa Xaa Xaa Lys Xaa Gin Trp Xaa Cys Asn Ser Tyr 
15 10 15 



<210> 35 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: template 

sequence for sublibrary used in construction of 
focused secondary display library 

<220> 

<221> VARIANT 
<222> (1) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (3) 

<223> X is any amino acid except Cys 
<220> 

<221> VARIANT 
<222> (14)-. (16) 

<223> X is any amino acid except Cys 
<400> 35 

Xaa Trp Xaa Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Xaa Xaa Xaa 
15 10 15 



<210> 36 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: family of CEA 
binding polypeptides 

<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa is Asp, Asn, Ala or lie 
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<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa is Val, He, Met, Tyr, Phe, Pro or Asp 
<220> 

<221> VARIANT 
<222> (5) 

<223> Xaa is Asn, Glu or Asp 
<220> 

<221> VARIANT 
<222> (6) 

<223> Xaa is Leu, Phe, Typ, Trp, Val Met, He or Asn 
<220> 

<221> VARIANT 
<222> (7) 

<223> Xaa is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, 
Ser, Val, Trp or Tyr 

<220> 

<221> VARIANT 
<222> (8) 

<223> Xaa is Lys, Phe, Asp, Gly, Leu, Asn or Trp 
<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa is Asn, Pro, Phe, Gly, Asp, Ala, Ser, Glu, Gin 
or Trp 

<220> 

<221> VARIANT 
<222> (10) 

<223> Xaa is Gin or Lys 
<220> 

<221> VARIANT 
<222> (12) 

<223> Xaa is Phe, Thr, Met, Ser, Ala, Asn, Val, His, 
He, Pro, Trp or Tyr 

<220> 

<221> VARIANT 
<222> (14) 

<223> Xaa is Asn, Asp, Glu, Pro, Gin or Ser 
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<220> 

<221> VARIANT 
<222> (15) 

<223> Xaa is Val # Leu, He, Pro, Ala, Gin, Ser, Met, 
Glu,Thr, Lys or Trp 

<220> 

<221> VARIANT 
<222> (16) 

<223> Xaa is Leu, Met, Val, Tyr, Ala, He, Trp, His, 
Pro, Gin, Glu, Phe, Lys or Arg 

<400> 36 

Xaa Trp Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Trp Xaa Cys Xaa Xaa Xaa 
15 10 15 



<210> 37 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 37 

Asp Trp Met Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Leu Met 
15 10 15 



<210> 38 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 38 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Leu Met 
15 10 15 



<210> 39 
<211> 16 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 39 

Asp Trp He Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Gin Met 
15 10 15 



<210> 40 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 40 

Asn Trp He Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Gin Glu 
15 10 15 



<210> 41 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 41 

Asp Trp He Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Gin Val Lys 
15 10 15 



<210> 42 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 
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<400> 42 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Met 
15 10 15 



<210> 43 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 43 

Asp Trp Met Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Gin lie 
15 10 15 



<210> 44 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 44 

lie Trp Asp Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Pro Ala Pro 
15 10 15 



<210> 45 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 45 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp lie Arg 
15 10 15 
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<210> 46 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 46 

Asp Trp Met Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Val 
15 10 15 



<210> 47 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 47 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Ala lie 
15 10 15 



<210> 48 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 48 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Met Ala 
15 10 15 



<210> 49 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 49 

Asp Trp Val Cys Glu Phe Leu Lys Met Gin Trp Ala Cys Asn Val Leu 
15 10 15 



<210> 50 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 50 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asn Val Met 
15 10 15 



<210> 51 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 51 

Ala Trp Pro Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Pro Pro Gin 
15 10 15 



<210> 52 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 52 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Leu 
15 10 15 
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<210> 53 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 53 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Lys Trp 
15 10 15 



<210> 54 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; CEA binding 
polypeptide 

<400> 54 

Asp Trp Val Cys Glu Trp Leu Lys Met Gin Trp Ala Cys Asn Met Leu 
15 10 15 



<210> 55 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 55 

Asp Trp Val Cys Asp Phe Phe Phe Asn Gin Trp Thr Cys Asn Leu Leu 
15 10 15 



<210> 56 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 56 

Asp Trp Val Cys Glu Met Phe Lys Ala Gin Trp Phe Cys Asn Ala Leu 
15 10 15 



<210> 57 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 57 

Asp Trp He Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Ala Trp 
15 10 15 



<210> 58 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 58 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Trp 
15 10 15 



<210> 59 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 59 
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Asp Trp Val Cys Glu Tyr Phe Lys Asn Gin Trp Phe Cys Asn Val Leu 
15 10 15 



<210> 60 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 60 

Asp Trp Val Cys Glu He Asp Lys Gly Gin Trp Thr Cys Asn Pro Leu 
15 10 15 

<210> 61 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 61 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asn Pro Phe 
15 10 15 

<210> 62 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 62 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Gin 
15 10 15 

<210> 63 
<211> 16 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 63 

Asp Trp Val Cys Asn Leu Phe Phe Gly Gin Trp Thr Cys Asn Leu Leu 
15 10 15 

<210> 64 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 64 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Glu Ala His 
15 10 15 

<210> 65 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 65 

Asp Trp Val Cys Glu Leu Val Lys Ala Gin Trp Tyr Cys Asn He Leu 
15 10 15 

<210> 66 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 
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<400> 66 

Asn Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Thr Val 
15 10 15 

<210> 67 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 67 

Asp Trp Val Cys Glu Phe Tyr Lys Ser Gin Trp Asn Cys Asn lie Leu 
15 10 15 

<210> 68 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 68 

Asp Trp Val Cys Glu Trp Phe Lys Pro Gin Trp Phe Cys Asn Pro Leu 
15 10 15 

<210> 69 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 69 

Asp Trp Tyr Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Leu 
15 10 15 
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<210> 70 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 70 

Asp Trp Val Cys Glu Tyr Asn Asp Glu Gin Trp Thr Cys Asn Leu Leu 
15 10 15 

<210> 71 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 71 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asn Glu Ala 
15 10 15 

<210> 72 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 72 

Asp Trp Val Cys Asn Trp Glu Leu Phe Gin Trp Thr Cys Asn Leu Leu 
15 10 15 

<210> 73 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 73 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Gin Val 
15 10 15 



<210> 74 

<211> 16 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 74 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Pro 
15 10 15 



<210> 75 

<211> 16 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 75 

Asp Trp Val Cys Glu Phe Phe Lys Gin Gin Trp Phe Cys Asn Val Leu 
15 10 15 



<210> 76 

<211> 16 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 76 

Asp Trp Val Cys Glu Phe Phe Lys Asp Gin Trp Ser Cys Asn Val Leu 
15 10 15 
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<210> 77 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 77 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Ser Leu 
15 10 15 

<210> 78 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 78 

Asp Trp Val Cys Glu Phe Met Lys His Gin Trp Phe Cys Asn Pro Leu 
15 10 15 

<210> 79 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 79 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Gin Ala Val 
15 10 15 

<210> 80 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 80 

Asp Trp Val Cys Glu Phe lie Lys Asn Gin Trp Met Cys Asn Val Leu 
15 10 15 

<210> 81 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 81 

Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Ala Leu 
15 10 15 

<210> 82 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 82 

Asp Trp Val Cys Glu Tyr Glu Lys Asp Gin Trp Ser Cys Asn lie Leu 
15 10 15 

<210> 83 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 83 
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Asp Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Thr Leu 
15 10 15 

<210> 84 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 84 

Asp Trp Tyr Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Val Tyr 
15 10 15 

<210> 85 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 85 

Asp Trp Phe Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Ser Pro lie 
15 10 15 

<210> 86 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 86 

Asp Trp Val Cys Glu Phe Phe Lys Lys Gin Trp Phe Cys Asn Leu Leu 
15 10 15 

<210> 87 
<211> 16 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 87 

Asn Trp Val Cys Asp Val Leu Lys Trp Gin Trp Pro Cys Asn Ser Tyr 
15 10 15 

<210> 88 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 88 

Asp Trp Val Cys Glu Tyr Asp Lys Gly Gin Trp His Cys Asn He Leu 
15 10 15 

<210> 89 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 89 

Asp Trp He Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Gin Gin His 
15 10 15 

<210> 90 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 
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<400> 90 

Asp Trp Val Cys Asn Trp Leu Trp Gly Gin Trp Thr Cys Asn Leu Leu 
15 10 15 



<210> 91 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 91 

Asp Trp Val Cys Glu Met Phe Lys Lys Gin Trp Val Cys Asn Pro Leu 
15 10 15 



<210> 92 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 92 

Asp Trp lie Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Gly Pro Leu 
15 10 15 



<210> 93 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 93 

Asp Trp Val Cys Glu Val He Lys Asp Gin Trp Val Cys Asn Pro Leu 
15 10 15 
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<210> 94 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 94 

Asp Trp Val Cys Glu Asn Lys Asn Phe Lys Trp Phe Cys Asn Leu Leu 
15 10 15 

<210> 95 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 95 

Asp Trp Val Cys Glu Tyr Ala Lys Asn Gin Trp Asn Cys Asn Pro Leu 
15 10 15 

<210> 96 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 96 

Asn Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Glu Trp Ala 
15 10 15 

<210> 97 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 97 

Asn Trp Val Cys Asp Tyr Trp Lys Pro Gin Trp Phe Cys Asn Ser Tyr 
15 10 15 

<210> 98 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 98 

Asp Trp Tyr Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Leu Val 
15 10 15 

<210> 99 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 99 

Asn Trp Val Cys Asn Leu Phe Lys Asn Gin Trp Phe Cys Asp Glu Met 
15 10 15 

<210> 100 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 100 

Asp Trp Val Cys Glu Leu Phe Lys Pro Gin Trp Phe Cys Asn lie Leu 
15 10 15 
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<210> 101 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 101 

Asp Trp Val Cys Glu Trp Ser Lys Met Gin Trp Ser Cys Asn Ala Leu 
15 10 15 

<210> 102 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 102 

Asp Trp Val Cys Asp Tyr Lys Phe Phe Gin Trp Thr Cys Asn Leu Leu 
15 10 15 

<210> 103 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 103 

Asn Trp Val Cys Glu Trp Leu Lys Pro Gin Trp Trp Cys Asn Ser Tyr 
15 10 15 

<210> 104 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 104 

Asp Trp Val Cys Glu Phe Phe Lys Pro Gin Trp Met Cys Asn lie Leu 
15 10 15 

<210> 105 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 105 

Asp Trp Val Cys Glu Tyr Phe Lys Ser Gin Trp Met Cys Asn Met Leu 
15 10 15 

<210> 106 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 106 

Asp Trp Val Cys Glu Phe Phe Gly Met Gin Trp Thr Cys Asn Leu Leu 
15 10 15 

<210> 107 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEA binding 
polypeptide 

<400> 107 
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Asp Trp Val Cys 61u Tyr Ala Lys Phe Gin Trp He Cys Asn lie Leu 
15 10 15 
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